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^ ( 57 ) Abstract: The present invention relates to methods for designing new drugs useful against drug-resistant bacterial cells, viruses, 
mammalian cells and the like. The method involves identifying a target protein of the drug, selecting for drug-resistant variants that 
Q have an altered target protein (variant protein) by directed evolution, determining the three dimensional structure of the target and 
^ variant proteins and designing a new drug that can be effective against at least one drug-resistant variant The present invention can 
^ be used to predict future mutations that lead to drug resistance and the type of drugs that are effective to combat such resistance. 
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SMALL MOLECULE DESIGN AGAINST DRUG RESISTANT 
MUTANTS USING DIRECTED EVOLUTION 

5 FIELD OF INVENTION 

The present invention combines the fields of directed evolution and 
rational drug design through the use of three-dimensional structural analysis to 
provide methods for identifying molecules that can effectively combat drug 
resistance. 

10 

BACKGROUND OF THE INVENTION 

Antibiotic resistance is gaining in importance as a medical problem 
because more microorganisms are becoming resistant to greater numbers of 
antibiotics. For example, some strains of Staphylococcus aureus have developed 

1 5 resistance to several distinct antibiotics and are now described as having a 
"multi-resistant* * phenotype. Unfortunately, due to the extensive use of 
antibiotics, and sometimes potential misuse of antibiotics, such as the self- 
prescription of ciprofloxacin (CIPRO) prevalent in the Fall of 2001, such multi- 
resistant strains of microorganisms are now found in the general population and 

20 within hospitals. Hospital-acquired bacterial infections occur in 5% of patients 
admitted to the hospital (about 2 million patients per year in the United States). 

"New" antibiotics are often structurally related to or derived from a 
previous generation of antibiotics. For example, cephalosporin is structurally 
related to penicillin. While these structural analogs of known antibiotics can be 

25 successful for a time, reports of resistance become prevalent as the new 
antibiotic becomes widely used. 

The progression of ^-lactam antibiotic resistance illustrates this problem. 
Since the introduction of penicillin in the 1940's, microbes have evolved 
resistance to 0-lactam antibiotics, typically by acquired or enhanced catabolism 

30 of these drugs through the enzyme ^-lactamase found in many bacterial strains. 
While researchers have made successive changes to the basic ^-lactam antibiotic 
structure in order to make new antibiotics that are more resistant to degradation 
by p-lactamases, microorganisms have responded by producing different types 
or larger amounts of P-lactamases. Such resistance is particularly problematic 

35 for hospital-acquired infections. A common mechanism for maintenance and 
transmission of p-lactam antibiotic-resistance is via plasmids that can readily 
amplify. When p-lactam antibiotics are routinely prescribed, as often occurs in 
hospitals, a small population of bacteria that are highly resistant to such 
antibiotics can propagate and spread throughout the hospital. This problem has 
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been addressed by the production of new generations of antibiotics that are 
usually more toxic than their predecessors, and which often cannot be 
administered to patients in a convenient manner. For example, vancomycin, 
which is often called the "antibiotic of last resort," has so much toxicity that its 
5 use is restricted to the most dangerous, multiply drug-resistant, infections. 

Another therapeutic approach to avoiding antibiotic resistance has been 
to use p-lactamase inhibitors. Unfortunately, although the different classes of p- 
lactamases have overlapping substrate specificities, they have evolved 
differently and have distinct amino acid sequences and different three- 

10 dimensional structures. Thus, currently available P-lactamase inhibitors often 
have widely varying efficacies for the different p-lactamase enzymes. 
Moreover, even when the primary amino acid sequences of various P- lactamases 
are known, such knowledge does not allow researchers to predict the structure of 
successful p-lactamase inhibitors. For example, individual mutations at position 

15 164 and 238 of TEM class p-lactamases occur in 35% and 32% of all antibiotic- 
resistant mutants, respectively, but rarely in combination. An understanding of 
the three-dimensional structure of TEM p-lactamase, including mutants of TEM 
containing mutations at position 164 or 238, is needed to understand how these 
mutations lead to antibiotic resistance and how to best combat such resistance. 

20 Similarly, the primary amino acid sequence information for known P-lactamase 
mutants does not necessarily provide information about other types of mutations 
that are likely to occur in the future. Accordingly, new approaches are needed 
for the design of antibiotics and inhibitors of antibiotic resistance that 
incorporate an understanding of the three-dimensional structures of the target 

25 microbial proteins involved. 

Vims and cancer resistance is also a recurring problem for the medical 
community. The occurrence of drug resistance to new and old drugs is 
increasing. As therapeutic agents are used, the targets of their action evolve to 
develop resistance against them, lowering the probability for successful 

30 treatment of a large number of disorders involving microbial or viral infections 
and neoplastic transformations. 

For example, RNA viral diseases are responsible for the vast majority of 
morbidity and mortality for viral diseases in humans. Such diseases include 
AIDS, hepatitis, rhinovirus infections of the respiratory tract, flu, measles, polio 

35 and others. Acquired Immune Deficiency Syndrome ("AIDS") is a fetal human 
disease that has recently grown to epidemic proportions. Epidemiological 
evidence indicates that this disease is caused by die human immunodeficiency 
viruses, HIV-1 or HTV-2. HIV is particularly difficult to eradicate for several 
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reasons. For example, HIV permanently incorporates its genetic material into 
the genome of infected cells, it replicates (Perelson, A. S., Science 271:1582 
(1966)) and it mutates (Larder, B. A., Science 246:1 155 (1989)) at an 
exceptionally high rates and thereby avoids immune inactivation, and it 
5 specifically infects and destroys the very immune system components most 
critical for controlling the infection. There is currently no effective vaccine for 
prevention of infection, and aside from the recent success for combination 
therapies involving protease inhibitors, few treatments for this fatal disease exist 
Furthermore, the virus rapidly develops mutations conferring resistance against 

10 all chemotherapeutic agents tested to date. See U.S. Patent No. 6,063,628, 
herein incorporated by reference. 

All currently approved anti-AIDS drugs are designed to inhibit either the 
human immunodeficiency virus reverse transcriptase (HIV RT) or the virally- 
encoded protease or a combination thereof. Chemicals directed against HIV RT 

1 5 are either deoxynucleoside analogs that terminate HIV DNA synthesis or non- 
nucleoside analogs that inhibit the reverse transcriptase (Larder, B. A., J. Gen. 
Virol. 75:951 (1994)). The guiding concept of all current AIDS therapies is to 
prevent further replication of the virus by directing drugs to interfere with the 
production or maturation of viral-encoded proteins. Unfortunately, the 

20 therapeutic effectiveness of these drugs is generally rendered obsolete by the 
emergence of resistant viral mutations. It has been estimated that the mutation 
rate of HIV is one million times greater than the mutation rate of human cells. 
Recently, combinations of drugs have been shown to be more effective in 
reducing the viral load in individuals and it is hoped that this reduction in viral 

25 load will result in prolongation of life (Ho, D., Science 272:1 125 (1996)). 
However, the reduction in circulating virus may merely delay, rather than 
prevent, the development of drug resistant mutants in an infected individual. 

Drug-resistant mutations also lead to difficulties in treating cancer. For 
instance, it is believed that many solid tumors, such as breast cancer, progress 

30 from initiation to metastasis through the accumulation of several genetic 
aberrations. See, Smith et al> Breast Cancer Res. Treat., 18 Suppl. 1: S 514 
(1991); vande Vijver and Nusse, Biochim. Biophys. Acta, 1072: 33-50 (1991); 
Sato et aL, Cancer Res., 50: 7184-7189 (1990). Such genetic aberrations, as they 
accumulate, may confer proliferative advantages, genetic instability and the 

35 attendant ability to evolve drug resistance rapidly, as well as enhance 

angiogenesis, proteolysis and metastasis. The genetic aberrations may affect 
either recessive "tumor suppressor genes" or dominantly-acting oncogenes. 
Deletions and recombination leading to loss of heterozygosity are believed to 
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play a major role in tumor progression by uncovering mutated tumor suppressor 
alleles. See U.S. Patent 5,965,362, herein incorporated by reference. 

Mutation also plays a role in drug resistant leukemic forms of cancer. 
According to Wheldon and colleagues (Br. J. Cancer Suppl., 1 8: S13-9 (1992)), 
5 one mechanism for early onset leukemogenesis is the occurrence of a high 
cellular mutation rate. This mechanism is thought to enable both the rapid 
evolution of leukemic cells as well as the early emergence of drug-resistant 
variants. Upon application of chemotherapy, variant cancerous cells that are 
resistant to the chemotherapeutic agent administered are more likely to survive, 

1 0 and may eventually dominate, leading to an aggressive, non-responsive 

malignancy (see also Ling et al. 9 Cancer Metastasis Rev. 4(2): 173-92 (1985)). 
In another example, Zhao and colleagues identified four murine leukemia cell 
lines resistant to 5,10-dideazatetrahydrofolate (DDATHF) and other folate 
analogs by chemical mutagenesis followed by DDATHF selective pressure (J. 

15 Biol. Chem. 275 (34): 26599-26606 (August, 2000)). Each of the clones 

contained mutations in each allele of the folypolyglutamate synthetase (FPGS) 
gene, typically in portions of the gene coding for predicted ligand binding sites 
that have very high homology across species from bacteria to man. In each of 
the clones, the two alleles of the FPGS gene mutated independently, suggesting 

20 that more than one drug resistant variant of the same native protein having 
different molecular structures may be expressed in a patient simultaneously. 

Accordingly, new methods are needed for designing effective drugs to 
combat drug resistance. 

25 SUMMARY OF THE INVENTION 

The present invention provides methods for identifying new drugs and 
potential inhibitors and modulators of drug-resistant variants of a target protein 
of a drug of interest A drug-resistant variant according to the invention has at 
least one mutation resulting in a structural change, an activity change or a 

30 stability change as compared to the target protein. Such variants would include 
natural variants such as those encountered in the clinic, but preferably variants 
are selected by directed evolution methodology. 

The invention therefore provides a method for identifying a new drag 
effective against at least one drug-resistant variant of a target protein, 

35 comprising: selecting at least one drug-resistant variant of a target protein in the 
presence of a drug; obtaining the three dimensional structure of at least one 
variant site of interaction with the drug; obtaining the three dimensional structure 
of the target protein site of interaction with the drug; comparing the three 
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dimensional structures of the target and variant sites of interaction to identify 
structural similarities and differences; and designing a new drug to interact with 
at least one of the variants by using die structural similarities or differences. The 
three dimensional structures of the variants and target proteins disclosed herein 
5 can be obtained by available procedures, for example, by using x-ray 
crystallography, NMR, or molecular modeling. 

The method may further comprise contacting a new drug with one or 
more drug-resistant variants or with the target protein to test the ability of the 
new drug to interact with the drug-resistant variants and/or the target protein. 

10 An interaction may be indicated by assaying the function of the drug-resistant 
variant(s) or the target protein in the presence of the new drug, or by co- 
crystallizing the new drug and the drug-resistant variants) or target protein or 
fragment thereof. New drugs may be selected for their ability to interact with 
both the drug-resistant variants) and the target protein. Alternatively, new drugs 

1 S may be selected for their ability to interact with regions of drug-resistant 
variant(s) that are structurally different than said target protein. New drugs 
would also include those selected for their ability to inhibit die folding of one or 
more regions of said drug-resistant variants) or said target protein. New drugs 
may be selected from a database of small molecules or ligands, and may include 

20 a combination of molecules or entities. Preferably, new drugs are designed 
using any one or more of die methods selected from the group consisting of 
structure based drug design (SBDD), virtual ligand screening, ligand screening, 
high-throughput screening and ultra high-throughput screening. The drug or the 
new drug can an antibiotic, an enzyme inhibitor, a protein inhibitor, an antibiotic 

25 resistance inhibitor, a herbicide resistance inhibitor, an insect repellant, an 
insecticide resistance inhibitor, a viral drug resistance inhibitor, a 
chemotherapeutic resistance inhibitor, a modulator, an antagonist, an agonist, an 
effector, a ligand, an antibody, an antibody fragment, a peptide or a small 
molecule. 

30 Proteins to be targeted by the disclosed methods include, for example, 

bacterial proteins, viral proteins, microbial proteins, fungal proteins, insect 
proteins, cellular proteins contributing to the development of cancer and cellular 
proteins involved in inflammation or autoimmune disorders or diseases. Such 
proteins would include enzymes, receptors, transporters, ion channels, 

35 transcription factors, DNA polymerases, RNA polymerases, ribosomal proteins, 
kinases, receptors, tumor suppressor proteins, cell cycle proteins and DNA 
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binding proteins, to name only a few. A preferred target protein is Beta- 
lactamase. 

Also encompassed in the present invention are methods for preparing 
libraries representing drug resistance evolution profiles of target proteins or 
5 target protein fragments. Such a method may comprise the steps of (a) 
generating a first population of sequence or structure variants of said target 
protein by one or more rounds of a first method of directed evolution in the 
presence of said drug; (b) generating at least one other population of sequence or 
structure variants by one or more further rounds of said first method of directed 

1 0 evolution or by at least one other method of directed evolution or by obtaining a 
library of natural variant isolates; (c) comparing sequences or structures or 
fragments of said variants in said first population to the sequences or structures 
or fragments of said variants in said at least one other population to identify 
variants having unique sequences or structures; and (d) preparing a library 

1 5 comprising said unique sequence variants that represents a drug resistance 

evolution profile of said target protein or target protein fragment. Such libraries 
may be composed of drug resistant variants that are profiled either by their 
primary amino acid sequences or by their three-dimensional structures. Libraries 
may be screened to identify potential inhibitors or modulators as disclosed 

20 herein, or may be analyzed to identify regions in a target protein that may be 
structurally altered during the course of a given drug treatment so as to convey 
resistance to a given drug. 

The invention also provides a data structure for storing information on 
directed evolution of a series of drug resistant variants comprising: a series of 

25 separate variant data fields, each variant data field comprising each x, y and z 
atomic coordinate of each drug resistant variant; a target protein data field 
comprising each x, y and z atomic coordinate of the target protein; a structural 
difference data field comprising the differences in x, y and z atomic coordinates 
for each drug resistant variant relative to the x, y and z atomic coordinates of the 

30 target protein; and a series of drug resistance data fields, each drug resistance 
data field comprising each x, y and z atomic coordinate of each drug that is 
resistant to a drug resistance variant. 
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The invention further provides a computer-assisted method for 
identifying a new drug effective against at least one diug-resistant variant of a 
target protein, comprising: obtaining the three dimensional structure of at least 
one variant site of interaction with the drug by using computer processing to 
5 generate x, y and z coordinates for each atom in the variant site of interaction; 
obtaining the three dimensional structure of the target protein site of interaction 
with the drug by using computer processing to generate an x, y and z coordinates 
for each atom in the target protein site of interaction; measuring the differences 
in distance between the x, y and z coordinates for each atom in the variant and 

10 target protein sites of interaction; designing a new drug to interact with the site 
of interaction of at least one of the variants by using computer processing to fit a 
library of potential new drugs into the variant and target protein sites of 
interaction and determining which potential new drug fits into the variant site of 
interaction bettor than into the target protein site of interaction. 

15 The invention also provides an article of manufacture having instructions 

stored on it that cause a computing system to identify a new drug effective 
against at least one drug-resistant variant of a target protein, the instructions 
comprising: instructions for comparing a series of separate variant data fields 
with a target protein data field to generate a structural difference data field 

20 comprising the differences in x, y and z atomic coordinates for each drug 

resistant variant relative to the x, y and z atomic coordinates of die target protein, 
wherein each variant data field within the series of separate variant data fields 
comprises each x, y and z atomic coordinate of each drag resistant variant, and 
wherein a target protein data field comprises each x, y and z atomic coordinate 

25 of the target protein; instructions for generating a new drug data field from the 
structural difference data field and a small molecule data field, wherein die small 
molecule data field comprises x, y and z atomic coordinates for a series of small 
molecules and wherein the new drug data field comprises small molecules that 
can optimally fit within the site of interaction of the drug-resistant variant New 

30 drugs identified using the article of manufacture are also provided by the 
invention. 

The invention further provides a method for identifying a new antibiotic 
effective against at least one antibiotic-resistant of bacteria, comprising: 
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selecting at least one antibiotic-resistant strain of bacteria by exposing the 
bacteria to a selecting antibiotic; obtaining the three dimensional structure of a 
target protein site of interaction with the antibiotic, wherein the target protein is 
within the bacteria and wherein the antibiotic has its effect through the target 
5 protein; obtaining the three dimensional structure of a target protein site of 
interaction with the antibiotic, wherein the target protein is within the bacteria 
and wherein the antibiotic has its effect through the target protein; obtaining the 
three dimensional structure of a variant protein site of interaction with the 
antibiotic, wherein the variant protein is within at least one antibiotic-resistant 

10 strain of bacteria selected in step (a) and wherein the variant protein is resistant 
to the selecting antibiotic; comparing die three dimensional structures of the 
target and variant sites of interaction to identify structural similarities and 
differences; and designing a new drug to interact with at least one of the variants 
by using the structural similarities or differences. 

IS The invention also provides a system that includes a computer system; 

memory accessible to the computer system, a series of data structures stored in 
the memory and one or more software components operable on the computer 
system. A computer system is any type of device having at least a processor and 
accessible memory and associated networks and peripherals, such as local area 

20 networks (LANS) and printers. See e.g., Fig. 7. Examples of computers include 
personal computers, laptops, supercomputers, and any other kind of computer. 
The computer system is capable of operating according to instructions, software 
components, and data structures. 

In particular, die invention provides a system comprising a computer 

25 system; memory accessible to the computer system; a first data structure stored 
in the memory comprising three dimensional structural data of a target protein; a 
second data structure stored in the memory comprising a first population of three 
dimensional structural data for a series of variants of the target protein, wherein 
the variants were generated by one or more rounds of directed evolution of die 

30 target protein in the presence of a drug; a third data structure stored in the 

memory comprising a second population of three dimensional structural data for 
a series of second generation variants of a first variant, wherein the second 
generation of variants is generated by one or more rounds of directed evolution 

8 
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of the first variant in the presence of a drug; a fourth data structure comprising 
three dimensional structural data for a population of potential new drugs 
comprising antibiotics, enzyme inhibitors, protein inhibitors, antibiotic resistance 
inhibitors, herbicide resistance inhibitors, insect repellants, insecticide resistance 
5 inhibitors, viral drug resistance inhibitors, chemotherapeutic resistance 
inhibitors, modulators, antagonists, agonists, effectors, ligands, antibodies, 
antibody fragments, peptides or small molecules; a first software component 
operable on the computer system to compare the first data structure with the 
second or third data structure and to identify three dimensional structural 
10 variants that represent a drug resistance evolution profile of said target protein; a 
second software component operable on die computer system to compare the 
fourth data structure with the drug resistance evolution profile of said target 
protein and thereby to design a new drug effective against at least one drug 
resistance variant 

15 

DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a graph depicting die frequency of mutations obtained 
depending on the methodology employed, and illustrates that directed evolution 
can identify the same ^-lactamase extended-spectrum antibiotic resistant 
20 mutations as are observed in nature (clinical settings). Four mutation methods 
are compared: natural mutations, use ofDNA analogs, use of gene shuffling, 
and use of a mutator strain. The arrows indicate which mutations are ESBL 
mutations. 

Fig. 2 is a time-line illustrating trends that have been observed in 
25 lactamase mutation combinations and the order of mutation appearance. Early 
generation mutations are in italics whereas later generation mutations are in 
bold. See Orencia et al. Nature Structural. Biology 8(3):238-42 (2001); Stemmer, 
370 Nature 389-91 (1984); Zaccoic et al., 285 J. Mol. Biol. 775-83 (1999); 
Jacoby et al., http://www.tabev.org/stiidies/webt.htm (2000); Poyart et al., 42 
30 Antimicrob. Agents Chemother. 108-1 13 (1998); Medeiros, Clin. Invest Dis. 
24(Suppl. 1): 319-45 (1997). 

Fig. 3 provides alternative views of the TEM-52 three-dimensional 
structure. Fig. 3 A pinpoints the E104K/M182T/G238S combination of 
mutations (identified by numbers 104, 182 and 238, respectively) and the loop 
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mutations at positions 42, 92 and 241 . Catalytic residue positions are at 
positions 70, 166 and 1 82 and the B3 {J-strand is the strand running upward from 
residue 238. Figs. 3B and 3C provide an overlay stereoview of wild type TEM-1 
(darker) and TEM-52 (lighter) illustrating movement of loops 238-243, 267-271 
5 and 40-43. Figs. 3D and 3E provide a stereoview of the electron density for the 
TEM-52 active site. See Orencia et al. Nature Structural. Biology 8(3):238-42 
(2001). 

Fig. 4 depicts the structure of the global suppressor mutation Ml 82T 
found in TEM-52. Fig. 4a and 4b provide alternative views of TEM-52, 
10 illustrating the interactions relevant to the Thr 182 mutation. See Orencia et al. 
Nature Structural. Biology 8(3):238-42 (2001). 

Figure 5 is a block diagram of one embodiment of a data structure 500. 

Figure 6 is a block diagram of one embodiment of a computer-assisted 
method 600 for identifying a new drug effective against at least one drug- 
15 resistant variant of a target protein. 

Figure 7 is a block diagram of an example system 700 involving a 
computer system 702 for practicing various embodiments of the present 
invention. 



20 DETAILED DESCRIPTION OF THE INVENTION 

The present invention utilizes a combination of directed or natural 
evolution and ligand screening and/or structural analysis to provide insight into 
die changes in protein conformation that lead to drug-resistance, as well as 
provide insight into die design of compounds, i.e., antibiotics and inhibitors, 

25 capable of combating drug-resistance by interacting with the enzymes and 
proteins responsible for drug-resistance. 

According to the invention, the design and selection of drug-resistant 
mutant proteins or enzymes or substrates can provide clues as to the molecular 
basis for drug resistance. The fields of directed evolution and rational design 

30 based on three-dimensional structure have been considered to be separate fields, 
with die proponents of each field being quick to dismiss the utility of the other. 
For instance, in a recent article by Tobin and others, site-directed mutagenesis of 
target residues based on structural data is described as resource- and labor- 
intensive, as well as time-consuming, with the authors stressing that evolution of 

35 new protein function may be studied in the absence of structural knowledge 
(Current Opinion in Structural Biol. 10: 421-27 (2000)). In contrast, in 
WO0159066, "Protein Design Automation for Protein Libraries," Dahiyat 
stresses the utility of computer modeling based on three-dimensional structure to 

10 
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identify small molecule ligands, but dismisses directed evolution because it 
ignores all structural and biophysical knowledge of proteins. 

The present invention goes further than any of the techniques of the prior 
art, in that it uses directed evolution or natural selection to obtain drug-resistant 
5 mutants, and then analyzes the three-dimensional structure of such mutant 

proteins or enzymes, etc., to predict the variants that will arise during die course 
of a drug treatment or that exist prior to treatment and are able to survive and 
dominate once a drug is administered. While others may have used either a 
directed evolution or structural approach, they have not realized that the 
10 combination of directed evolution and structural analysis leads to a better 

understanding of protein function. Moreover, none have taken such an approach 
and used the data accumulated to design new drugs and protein inhibitors that 
may be used to treat patients whose disease has become non-responsive due to 
drug-resistance. 

1 5 The invention allows the practitioner to predict and profile the evolution 

of resistance within target proteins in response to administered drugs. In 
addition, die invention provides methods for designing antibiotics, antiviral 
compounds, inhibitors, and small molecules against native and evolved targets. 
Such target proteins include bacterial and viral proteins and proteins involved in 

20 the development of cancer or immune disorders. In general, the present methods 
involve analysis of structure-activity relationships and/or the structural analysis 
of target proteins in the context of ligand screening to define the types of 
molecules that will most effectively interact with the target protein. Using the 
invention, key structural features of the target proteins are identified and 

25 predictions are made as to the optimal interaction of the target with molecules 
that may act as antibiotics and inhibitors. Particular target proteins of interest 
include drug-resistant variants that arise in the course of drug treatment The 
invention includes the prediction of such mutations by directed and/or natural 
evolution, which involves mutation of a target protein, observation of functional 

30 changes in the target protein, analysis of the primary and tertiary structural 

changes responsible for those functional changes and identification of specific 
chemical structures or molecules which have the appropriate size, shape and 
chemical properties to modulate the activity of the target protein. 

One embodiment of the invention involves a method for identifying a 

35 new drug effective against at least one drug-resistant variant of a target protein, 
comprising: (a) isolating or selecting at least one drug-resistant variant of target 
protein in the presence of the drug; (b) obtaining the three-dimensional structure 
of at least one variant or of the drug interaction site of the variant; (c) obtaining 

11 
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the three-dimensional structure target protein or of the drug interaction site of the 
target; (d) comparing the three-dimensional structures of the variants) and the 
target protein or of the drug interaction sites of the variant(s) and the target to 
identify structural similarities and/or differences; and (e) designing or selecting a 
5 new drug using the structural similarities and/or differences. 

Those of skill in the art will appreciate that the term "selecting" must be 
read in context For instance, "selecting" with regard to variants refers to the 
process of imposing selective conditions, i.e., growing cells that contain the 
target protein in die presence of the drug, in order to permit the outgrowth, 
10 identification and isolation of mutants or variants that are resistant to the 

selective conditions. In contrast, "selecting" in terms of drug design means, for 
instance, that a potential inhibitor may be chosen from any existing collection or 
database and in this regard may be considered to be "selected" rather than 
"designed." 

IS In order to select or isolate at least one drug-resistant variant of target 

protein, the organism or cells containing die target protein are exposed to a 
selecting drug. In general, the selecting drug is a known drug. For example, the 
selecting drug may be a known drug that has already been observed to give rise 
to drug-resistant variants or mutations. The target protein is the protein with 

20 which the selecting drug interacts. 

The drug can be, for example, an antibiotic, an antibiotic resistance 
inhibitor, a herbicide resistance inhibitor, an insect repellant, an insecticide 
resistance inhibitor, a viral drug resistance inhibitor, a chemotherapeutic 
resistance inhibitor, a modulator, an antagonist, an agonist, an effector, a ligand, 

25 an antibody, an antibody fragment, a peptide or a small molecule. A modulator 
is a compound that can interact with die drug-resistant variants) of die target 
protein, thereby rendering drug-resistant variant susceptible to inhibition by (he 
selecting drug or any other drug of interest In this instance, die drug-resistant 
variants) effectively becomes a target for the selecting drug or a "first 

30 generation drug resistant variant" Several "generations" of variants and of new 
drugs can be selected so that new generations of new drugs may be designed. 

The method may include further steps such as contacting a new drug 
identified by the method with either the drug-resistant variant or the target 
protein to test the ability of the new drug to interact with the drug-resistant 

35 variants) and/or the target protein. In this case, "interact" means bind to, e.g., 
by hydrogen bonding, ionic interactions, van der Waals or hydrophobic 
interactions, or fit into the structure of the variant or the target protein in such a 
way so as to form a combined structural entity or a combination thereof, where 
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the interaction of the new drug with the variant or the target protein preferably 
results in a change of function, i.e., inhibition of function or an influence on the 
biophysical properties (i.e. in the case of a modulator). Interaction of a new drug 
with drug-resistant variants and target proteins may be detected by "assaying" 
5 the function of the drug-resistant variants or the target protein in the presence of 
the new drug, for instance by growing bacteria or viruses or cells expressing the 
drug resistant variant in the presence of the new drug to determine if the new 
drug inhibits or decreases multiplication of the bacteria, viruses or cells. 
Alternatively, interaction of a new drug with drug-resistant variants or the target 

10 protein may be shown by co-crystallizing the new drug with the drug-resistant 
variants) or target protein. 

The steps of the methods described herein may be repeated to isolate 
further potential inhibitors or modulators. For instance, a method according to 
the invention may further include identifying a second generation new drug of at 

1 5 least a second generation drug-resistant variant of the first drug-resistant variant 
Such a second generation drug-resistant variant is found using the new drug 
designed or isolated using the first drug-resistant variant 

Such an extended method might include (a) selecting at least one second 
generation drug-resistant variant of the first drug-resistant variant by exposing 

20 the first drug-resistant variant to the new drug; (b) obtaining the three- 
dimensional structure of at least one second generation variant or fragment 
thereof and the three-dimensional structure of the first drug-resistant variant or 
fragment thereof; (c) comparing the three-dimensional structures of the second 
generation drug-resistant variants) and the first drug-resistant variant or the 

25 fragments thereof to identify structural similarities and/or differences; and (d) 
designing or selecting a second generation new drug using the structural 
similarities and/or differences. 

Other further methodology steps might comprise using SAR (Structure 
Activity Relationships) and/or another structure/function-based algorithm to 

30 design additional potential inhibitors effective against the drug-resistant 
variants) and/or the target protein. 

The steps of the method may be repeated in parallel with at least two, or 
about ten, or more preferably about 50, about 100, about 250, about 500, about 
750 or even about 1000 to about 2000 or more variants simultaneously, i.e., in 

35 high throughput fashion. Indeed, one of the particular advantages of the present 
invention is the ability to produce crystals in nano- or micro volumes, permitting 
simultaneous screening of multiple variants. Tools for performing die methods 
disclosed herein in high throughput fashion are known and available, for 
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instance as disclosed in U.S. Patent 6,296,673, which is herein incorporated by 
reference in its entirety. Any multiwell apparatus would be amenable to use in 
the parallel screening embodiments disclosed herein, i.e. 24-well plates, 96-well 
plates, 3 84- well plates, 1536-well plates, etc. 
5 The invention also encompasses new drugs of drug-resistant variant 

proteins and native target proteins that are isolated by the methods of the 
invention, including compositions comprising one or more of such inhibitors. In 
particular, the invention encompasses any drug, inhibitor or modulator that is 
targeted against (1) any drug resistant variant (2) a variant containing a novel 

10 mutation or (3) a variant containing novel mutation combinations that are 
isolated or determined by these methods. 

The invention encompasses pharmaceutical compositions comprising the 
new drugs of the invention. Such compositions can include one or more other 
known drugs to optimally interact with target and variant proteins. Preferred 

1 5 combined therapeutics would include the drug used to select the drug-resistant 
variants) used to design the new drugs of the invention, for instance to be 
administered in combination with the new drugs of the invention with the goal of 
preventing, reducing or delaying the development of resistance to the selecting 
drug. 

20 

Drug-Resistant Variants 

According to the present invention, drug-resistant variants are proteins 
that are structurally similar but not identical to the target protein. The structural 
similarities and differences are specifically generated or identified and structural 
25 information relating to those similarities and differences is used to design new 
drugs. 

Drug-resistant variants may be isolated from any available source by one 
of skill in the art. For example, the drug-resistant variants can be isolated from a 
patient who is or was undergoing treatment with the drug. A "patient" may be a 

30 human or other animal. Drug-resistant variants can be isolated or selected from 
an agricultural setting. Drug resistance involving the herbicides, insecticides, 
antibiotics and other drugs used in agriculture is a very big problem. Very often 
the same antibiotics are used in both humans and animals. For instance, a 
patient being treated with a particular drug for a bacterial or viral infection or 

35 disease may develop a subsequent non-responsive infection mediated by 
outgrowth of drug-resistant bacteria or viruses or cell population. A variant 
protein expressed or displayed by such bacteria or viruses or cell population that 
confers drug-resistance on such bacteria or viruses or cell population would be 
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considered to be a "drug-resistant variant" according to the present invention. It 
will be appreciated, however, that the methods could also be performed 
following the isolation of mutants that exist in the population without 
performing selection in the presence of the drug, for instance, by detecting 
S mutants using antibodies or by binding to other types of ligands. 

Drag-resistant variants may also be selected by at least one cycle of 
directed evolution. The phrase "at least one cycle" is intended to convey that 
repetitive directed evolution experiments may be performed on original variants, 
second generation variants, third generation variants etc. to isolate variants 

10 having more than one or two mutations. Directed evolution may be 

accomplished by any technique known to those skilled in the ait, including those 
selected from the group consisting of random mutagenesis, use of DNA analogs 
and DNA shuffling. Directed evolution may also be performed using a 
hypermutator strain of E. coli. Such methods are described in more depth in the 

1 5 disclosure to follow. 

Preferred drug-resistant variants have at least one structural mutation 
resulting in a functional change, an activity change or a change in the protein 
stability as compared to die target protein. Preferred drug-resistant variants may 
have a combination of two or more mutations resulting in a structural change as 

20 compared to the target protein. In addition, dnig resistant variants include 
mutations that confer resistance by stabilizing the overall fold of the protein. 
Drug-resistant variants would also include variant target proteins expressed by 
genes having one or more mutations in the signal sequence that lead to increased 
levels of protein expression, for instance by stabilization of the p re-protein prior 

25 to cleavage of the signal sequence. 

Drug-resistant variants would also include those with changes in 
conformation or modification due to the action of accessory proteins, or due to 
chemical modifications such as phosphoiylation/dephosphoiylation, 
suUation/desulfation, glycosylation/deglycosylation, presence/absence of metal 

30 ions/cofactors/substrates/products/ effectors/modulators, or due to changes in 
folding. Changes in folding can also be due to mutations in chaperone proteins 
or other proteins rather than the drug-resistant variant itself. In cases where 
structural changes, stability or instability are achieved through mutation of 
accessory proteins of the usual drug target, the resulting drug-resistant variants 

35 would still be important targets of new drugs based on structural differences, 
even though they acquire the drug-resistant phenotype indirectly through 
mutation of a gene encoding another protein. Likewise, such indirect drag- 
resistant mutants would still be a source of structural data for identifying 
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domains important for the design of new drugs, inhibitors and/or modulators that 
interact with either an indirect drug-resistant variants, a direct drug-resistant 
variant or a target protein. 

In this regard, the present invention also encompasses methods for 
5 profiling organisms to identify protein targets that are susceptible to conferring 
drug resistance to the host cells through mutation, including those proteins that 
do not directly interact with the drug of interest Such profiling may be 
accomplished, for instance, by comparing transcripts from drug-resistant cells to 
transcripts from susceptible cells to identify differences in die level of expression 

10 or the sequences of the transcripts expressed. Tools for accomplishing such 
profiling are disclosed in U.S. Patent 6,333,155, herein incorporated by 
reference, which discloses a method of comparing gene transcription induced by 
drug exposure to gene transcription induced by a known mutation to determine if 
the known mutation is in a candidate target of the drug. The present invention, 

15 in contrast, encompasses a method whereby the transcript profiles of drug- 
resistant and non-resistant cells are compared and used to identify transcripts 
originating from mutated genes, or transcripts that have increased in abundance 
due to the activity of mutated genes. Such profiling methodology may be used 
to identify further targets for the drug design methods disclosed herein. 

20 The drug-resistant variant proteins and mutant proteins identified in the 

present methods are also an aspect of the invention, as are the nucleic acids 
encoding them, including the genes, cDNA, mRNA, genomic nucleic acids and 
nucleic acid fragments thereof. Structurally homologous variants of these 
variants, Le. y genes encoding proteins containing mutations that do not change 

25 the amino acid sequence or lead to conservative substitutions that do not alter the 
structure of die variant, are also included, as are libraries comprising the 
variants, proteins and mutant proteins disclosed herein, or cells expressing the 
same. Libraries of mutant proteins may be used to test candidate drugs as 
disclosed in U.S. Patent 6,063,562, which is herein incorporated by reference. 

30 Alternatively, such libraries could be used in the parallel screening or high 

throughput drug design methodology disclosed herein. 

Variant proteins and second-generation or further generation variant 

proteins will find other uses than in die inhibitor design and screening methods 

disclosed herein. For instance, such variants could form die basis of vaccine 

35 compositions for administration to either humans or animals, for the purpose of 

raising a protective immune response against drug-resistant pathogens 

expressing or displaying variant proteins. Such drug-resistant variants may also 
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be used to raise antibodies (both for diagnostic and/or neutralizing purposes) and 
antisera using methods known in the art or may be used to identify other non- 
inhibitory ligands, which could then form the basis of diagnostic assays to be 
used to screen patients for the presence of drug-resistant cells, tumors or 
S pathogens. Kits based on such assays, for instance which include the drug- 
resistant variant proteins described herein and/or antibodies or ligands specific 
thereto, are also envisioned. 

Drugs 

10 A selecting drug as well as a new drug of the invention can an antibiotic, 

an enzyme inhibitor, a protein inhibitor, an antibiotic resistance inhibitor, a 
herbicide resistance inhibitor, an insect repellant, an insecticide resistance 
inhibitor, a viral drug resistance inhibitor, a chemotherapeutic resistance 
inhibitor, a modulator, an antagonist, an agonist, an effector, a ligand, an 

1 5 antibody, an antibody fragment, a peptide or a small molecule. 

New drugs of the invention may be selected for the ability to interact 
with a drug-resistant variants) and/or the target protein. In some instances die 
new drug interacts with structurally distinct regions of a variant protein that may 
not be present in the native target protein. Alternatively, a new drug, e.g. a 

20 potential inhibitor or modulator, may be selected for its ability to interact with 
regions of a drug-resistant variants) that are structurally similar to those in the 
native target protein or to bind to regions that are structurally critical to both 
target and variants. 

A target protein "region" or "interaction site'* according to the present 

25 invention can be an active site, allosteric site, regulatory site, global suppressor 
site, as well as any mutation site located in any region of the sequence or 
structure of a target protein that influences any biophysical property, i.e., 
activity, stability, expression level, or biological function, of these 
aforementioned regions. Such regions would include, but would not be limited 

30 to, domains and domain interfaces of a protein, and parts or fragments thereof 
A "domain" may be defined structurally as a compactly folded part of a 
protein that is linked to other domains by one or few other structural elements 
such as a loop or helix. A domain may also be defined dynamically as a 
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relatively rigid region in a protein that is separated from other domains by more 
flexible interdomain regions. A dynamical domain may be isolated or identified 
by calculating the large scale movements of certain points over the entire 
protein, either through normal mode analysis or by comparing two or more 
5 structures of the same protein that have domain movements. The parts of a 
protein that move in synchronous fashion are grouped in dynamical domains, 
with a domain "interface" being composed of regions in a domain that interact 
with regions in an adjacent domain: Such interactions are particularly important 
near die hinge regions between domains. The present invention allows for the 

1 0 identification of protein regions and domains and domain interfaces that are 
important for the development of drug resistance; therefore, once such regions 
are identified, a potential inhibitor may also be selected for its ability to inhibit 
the folding of one or more important regions or domains of the drug-resistant 
variant or the target protein. 

1 5 New drugs may be selected from a database of inhibitors or ligands, for 

instance using computer modeling and/or high throughput screening. Preferred 
databases for use in the present invention include databases of small organic or 
inorganic molecules. Libraries of small compounds which find use herein will 
generally comprise at least 2 compounds, often at least about 25 different 

20 compounds, more often at least about 100 different compounds, and usually at 
least about 200 to 300 different compounds. Such a library of organic 
compounds to be screened against the modified polypeptide of interest may be 
obtained in a variety of ways including, for example, through commercial and 
non-commercial sources, by synthesizing such compounds using standard 

25 chemical synthesis technology or combinatorial synthesis technology (see 

Gallop et al. 9 J. Med. Chem. 37: 1233-1251 (1994), Gordon et aL 7 J. Med. Chem. 
37: 1385-1401 (1994), Czaraik and Eilman, Acc. Chem. Res. 29:1 12-170 (1996), 
Thompson and Eilman, Chem. Rev. 96:555-600 (1996), and Balkenhohl et al y 
Angew. Chem. Int Ed. 35:2288-2337 (1996)) and by obtaining such compounds 

30 as degradation products from larger precursor compounds, e.g. known 

therapeutic drugs, large chemical molecules. A large number of small chemical 
compounds are readily obtainable from commercial suppliers such as Aldrich 
Chemical Co., Milwaukee, Wis. and Sigma Chemical Co., St. Louis, Mo. 
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It should be understood that, although the phrase "new drug" is written as 
a singular entity, a new drug may comprise a combination of two or more 
individual entities, i.e., at least two peptides or small molecules, where the two 
entities interact in combination or individually with the variant or target protein. 
5 New drugs may be selected using drug design with any one or more of the 
methods selected from the group consisting of structure based drug design 
(SBDD), virtual ligand screening, ligand screening, high-throughput screening 
and ultra high-throughput screening as discussed further in depth below. For 
instance, U.S. Patent 6,297,021 discloses methods for structure-based drug 

10 design and is herein incorporated by reference in its entirety. 

Preferred new drugs of the invention are inhibitors and modulators of the 
variant protein. Preferred new drugs of the invention are small molecules that 
generally are not large proteins or polynucleic acids. Compounds that find use 
in the present invention include, for example, aldehydes, ketones, oximes such as 

1 5 Oalkyl oximes, preferably O-methyl oximes, hydrazones, semicarbazones, 

carbazides, primary amines, secondary amines such as N-methylamines, tertiary 
amines such as N,N-dimethylamines, N-substituted hydrazines, hydrazides, 
alcohols, ethers, thiols, thioethers, thioesters, disulfides, carboxylic acids, esters, 
amides, ureas, carbamates, carbonates, ketals, thioketals, acetals, thioacetals, aryl 

20 halides, aryl sulfonates, alkyl halides, alkyl sulfonates, aromatic compounds, 
heterocyclic compounds, anilines, alkenes, alkynes, diols, amino alcohols, 
oxazolidines, oxazolines, thiazolidines, thiazo lines, enamines, sulfonamides, 
epoxides, aziridines, isocyanates, sulfonyl chlorides, diazo compounds, acid 
chlorides, and die like. 

25 Virtually any molecule that is capable of bonding covalently or non- 

covalently or by any other means to a drug-resistant variant and/or target protein 
as disclosed herein may find use in the present invention. 



Suitable Target Proteins 

30 Suitable target proteins for use in the disclosed methods include proteins 

selected from the group consisting of bacterial proteins, viral proteins, microbial 
proteins, fungal proteins, cellular proteins contributing to the development of 
cancer, cellular proteins involved in inflammation or autoimmune disorders or 
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diseases, and agricultural targets, e.g. herbicide resistance, insect resistance, etc. 
A suitable target protein of the present invention is any protein that is the target 
of a drug, Le. 9 antibiotic, chemotherapeutic, antibody reagent, herbicide, 
insecticide or any other inhibitor, and which may undergo structural changes 
5 either by mutation or other means to confer drug-resistance to the pathogen or 
cell in which it is expressed 

A target protein may be found in any pathogenic organism, but preferred 
target organisms include those selected from the group consisting of 
Haemophilus influenzae, Staphylococcus aureus y Streptococcus pneumoniae, 

10 Enterococcus faecal is, Escherichia co/i, Salmonella, Pseudomonas aeruginosa, 
Bacillus anthracis, Yersiniia pestis, Francisella hilar ens is. Brucella, Caxiella 
burnetii, staphylococci, Clostridium botulinum, Bordetella pertussis, Neisseria 
meningitidis Z2491, Streptococcus mutan, Klebsiella pneumoniae, Chlamydia 
pneumoniae, Chlamydia trachomatis, Heliobacter pylori, Mycoplasma 

15 pneumoniae, Legionella pneumophila, Legionella micdadei, Treponema 
pallidum, Toxoplasma gondii, Borrelia burgdorferi, Campylobacter jejuni, 
Listeria monocytogenes, Neisseria gonnorhea, Neisseria meningitidis, Rickettsia 
prowazekii, Mycoplasma genitalium, Mycobacterium tuberculosis, 
Streptococcus pyogenes, Vibrio cholerae, Trypanosoma cruzi, Cryptosporidium 

20 parvum, human immunodeficiency virus (HIV), cytomegalovirus (CMV), 
hepatitis C virus (HCV), herpes simplex virus (HSV), bovine diarrhea virus, 
human rhinovirus (HRV), hepatitis B vims (HBV), influenza virus, variola virus 
(smallpox), viral encephalitides, arena viridae virus, bunyaviridae virus, 
filoviridae virus, flaviviridae virus, and filamentous fungi (Fusarium, 

25 Myrotecium, Cephalosporium, Trichodenna, Verticimonosporium, Stachybotrys 
species). 

Potential cancer targets include cancer of the breast, prostate, throat, 
ovary, lung, brain, pancreas, stomach, and skin to name a few, as well as cancers 
of the blood including all forms of leukemia, lymphoma, myeloma, etc. 

30 Potential immune targets include proteins in cells involved in inflammatory and 
autoimmune diseases. Examples of immune, allergic and inflammatory disorders 
or diseases that may be targets of die present invention include general 
inflammation, skeletal-muscular disorders, osteoarthritis, gout, asthma, lung 
edema, adult respiratory distress syndrome, pain, aggregation of platelets, shock, 

35 rheumatoid arthritis, juvenile rheumatoid arthritis, psoriatic arthritis, psoriasis, 
autoimmune uveitis, allergic encephalomyelitis, systemic lupus erythematosis, 
acute necrotizing hemorrhagic encephalopathy, idiopathic thrombocytopenia, 
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polychondritis, chronic active hepatitis, idiopathic sprue, Crohn's disease, Graves 
ophthalmopathy, primary biliary cirrhosis, uveitis posterior, interstitial lung 
fibrosis; allergic asthma; and inappropriate allergic responses to environmental 
stimuli such as poison ivy, pollen, insect stings and certain foods, including 
5 atopic dermatitis and contact dermatitis. 

Preferred target proteins may be selected from the group consisting of 
enzymes, transcription factors, DNA polymerases, RNA polymerases, ribosomal 
proteins and DNA binding proteins, or any protein which constitutes or 
comprises a site for antibiotic or inhibitor action or a site for antibiotic 

10 resistance. For instance, target proteins may be selected from the group 

consisting of Topo IV B subunit (ParE), gyrase B subunit (GyrB), P-lactamases, 
penicillinases, cephalosporinases, chloramphenicol acetyltransferases, 
aminoglycoside modifying enzymes, D-ala D-ala ligase (ddl), Peptide 
deformylase, FabI, GlmU, MurA, MurB, MurC, MurD, FtsZ, Dam methylase, 

15 emr/mmr multidrug resistance, bacterial DNA methylase, squalene synthase, 
human immunodeficiency virus (HIV) reverse transcriptase, HIV protease, HIV 
integrase, HTV TAT/TAR, HIV NCp7, feline immunodeficiency virus protease, 
CCR5/CXCR4, cytomegalovirus (CMV) polymerase, HCV NS3 helicase, HCV 
NS3-4A (Ser protease), HSV protease, HSV TS, Bovine diarrhea virus NS5B 

20 (RNA synthesis), HRV 3C protease, HBV PPK (polyphosphate kinase), HMG 
Co-A reductase, Chitin synthase- 1, D- glucan synthase, inositol 
phosphorylceramide synthase (IPC), EF-2, neuraminidase, topoisomerase, 
thrombin, Factor Xa, cathepsins B, C, S and K, caspases, matrix 
metalloproteinases, Cyclin-dependent kinases, urokinase, EGFR tyrosine kinase, 

25 Janus kinase family (JAK), STAT, Bcl-2, receptor family tyrosine kinase, insulin 
receptor kinase, protein-tyrosine phosphatases, Src family kinases, Lck family 
kinases, ZAP/SYK family kinases, and TEC family tyrosine kinases, 
angiotensin-converting enzyme, monoamine oxidase and tetrahydrofolate 
reductase. A particularly preferred target protein is beta-lactamase, with the 

30 selecting drug being any penicillin derivative or beta-lactam antibiotic, including 
cefotaxime as described in the non-limiting examples included infra. 

Antibiotic Resistance Target Proteins 

As described above and used herein, target proteins include those 
35 proteins which constitute a site for drug, antibiotic or inhibitor action, or a site 
for drug or antibiotic resistance. Any protein known by one of skill in the art to 
interact with an antibiotic, whose three-dimensional structure and function can 
be determined, is contemplated. The present methods therefore permit 
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development of new drugs such as new antibiotics, new inhibitors, new 
modulators and new compounds to combat antibiotic resistance which can draw 
upon any information available to one of skill in the art about these antibiotic, 
inhibitor, and antibiotic-resistance mechanisms. 
5 As illustrated in the following table, many antibiotics have well 

characterized targets of action. Any of these targets provides a suitable target 
protein for use in the methods of the present invention: 



Antibiotic Family 


Example 


Target 


3-lactam antibiotics 


penicillins, cephalosporins 


cell wall biosynthesis 


Sulfonamides 


Sulfanilamide 


blocks synthesis of 
tetrahydrofolate 


Aminoglycosides 


streptomycin 


protein synthesis 


trimethoprim 




folate metabolism 


Chloramphenicol 




protein synthesis 


Vancomycin 




cell wall synthesis 



10 Mechanisms of antibiotic action are further described in the table below. 



Antibiotic Type 


Type of Action 


Effect 


Psicofuranine 


Inhibits xanthylic acid 
aminase 


Prevents synthesis of GMP, 
inhibits DNA synthesis 


Nalidixic acid 


Replicator 


Blocks DNA synthesis 


Griseofulvin 


Replicator 


Blocks DNA synthesis 


Mitomycin 


Binds complementary 
strands of DN A 


Inhibits DNA replication by 
inhibiting strand separation 


Actinomycin D 


Binds to guanine in 
DNA 


Inhibits RNA synthesis 


Rifampicin 


Binds to RNA 
polymerase 


Prevents RNA synthesis 


Rifamycin 


Binds to RNA 
polymerase 


Prevents RNA synthesis 


Streptomycin 

Neomycin 

Kanamycin 


Binds to 30S 
ribosomal subunit 


Inhibits protein synthesis by 
stimulating miscoding and 
inhibiting peptide chain 
elongation 


Tetracycline 
Edeine 


Binds to 30S 
ribosomal subunit 


Inhibits binding of amino acyl- 
tRNA to ribosome 


Puromycin 


Binds to 50S 
ribosomal subunit 


Causes premature release of 
incomplete polypeptide chains 


Chloramphenicol 


Binds to SOS 
ribosomal subunit 


Inhibits attachment of amino 
acyl-tRNA to ribosome 


Erythromycin 


Binds to 50S 
ribosomal subunit 


Inhibits protein synthesis 


Lincomycin 


Binds to 30S 
ribosomal subunit 


Prevents peptide bond formation 
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Antibiotic Type 


Type of Action 


Effect 


D-cycloserine 
vpxamycinj 


Inhibits racemase 

UalloiUIIIllIlg L IU 17 

alanine 


Prevents synthesis of peptide 

blUC wXialll Uil lIlUJallllL* awlU 


Vancomycin 

Risocetin 

Bacitracin 


Inhibits growth of 
glycopeptide polymer 


Prevents synthesis of murein 


Penicillin 
Meimcuiin 
Ampicillin 
Cephalosporin 


Inhibits 

uxinspepoaauon oi 
side chain to bridge 


Prevents crosslinking of chains 
oi giycopcpuue 


T «rAAA^A«%1%ftM 

Lysostapmn 


riyoro lyzes pepnue 
side chain and cleaves 

muramic acid- 

lUUHIIIUV HVIU 

glucosamine polymer 


Lysis oi siapnyiococci 


Tyrocidine 


Damage to membrane; 
decreases respiration 


Leakage of cell contents 


Gramicidin 


Uncouples oxidative 
phosphorylation; binds 
to membrane 


Leakage of cell contents 


Polymycin 
Colistin 


Releases proteins from 
membrane 


Leakage of cell contents 



Antibiotic resistance is commonplace. Many mechanisms for antibiotic 
resistance are known, including over-expression of target enzymes, expression of 
5 antibiotic inactivating enzymes, and mutation of a target protein so that it is no 
longer recognized by the antibiotic. Preferred target proteins of the present 
invention undergo structural changes that confer antibiotic resistance, for 
instance because of internal mutations or the action of other proteins. Examples 
of some mechanisms of antibiotic resistance are given below: 



Antibiotic 


Principle mechanism of resistance 


Penicillins and 
other P-lactam 
antibiotics 


Inactivation by ^-lactamase 


Sulfanilamide 


Mutation of dihydropteroate synthase 


Aminoglycosides 


Inactivation by aminoglycoside modifying enzyme, or 
by target mutation 


Trimethoprim 


Mutation of dihydro folate reductase 


Chloramphenicol 


Inactivation by chloramphenicol transacetylase 


Methicillin 


Mutation of penicillin binding proteins 


Vancomycin 


Mutation in target cell wall peptide 
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Inactivation of an antibiotic is probably the most common mechanism for 
drug resistance. For example, the most common form of resistance to (J-lactam 
antibiotics is the production of (^-lactamases, which degrade the antibiotic 
molecule. Other enzymes involved in drug resistance include penicillinases, 
5 cephalosporinases, choramphenicol acetyltransferases and aminoglycoside 
modifying enzymes. Such enzymes inactivate antibiotics by modifying them to 
inactive compounds. Other mechanisms which contribute to antibiotic resistance 
include drug permeability mutations, expression of transport proteins that 
actively extrude antibiotics from target organisms, and mutations in the cellular 

1 0 targets of antibiotics. 

Recently, two studies surveyed the genetic diversity present in isolates 
obtained from single hosts harboring the gastric pathogen Heliobacter pylori 
p.A. Israel et a/., Proa Natl. Acad. Sci. USA 98:14625-30 (2001); B. 
Bjdrkholm et ai y Infect. Immun. 69:7832-8 (2001)) . In both studies, multiple 

1 5 genetic variants of H. pylori were found, with some isolates having new genetic 
material, while other strains taken from the same host had missing genes. Thus, 
bacteria are in a constant state of genetic flux, especially when antibacterials are 
administered or during other potentially lethal challenges, such as survival in a 
host organism. Since new forms of many genes generally occur in all organisms 

20 (especially pathogenic microbes and viruses, as well as neoplastic cells) the 

approach outlined in the present invention would also be applicable to any such 
newly acquired or newly expressed genes that are structurally modified to 
convey drug resistance. 

25 

Anti-Viral Drugs 

The methods of the present invention are also applicable to the design of 
new drugs for the treatment of viral infections, and may be applied to any virus 
that has been isolated. Preferably, the virus is characterized by one of skill in the 

30 art with respect to gene expression, protein content and structure. In this 

embodiment, any viral protein that undergoes structural changes to confer drug 
resistance, or any cellular protein that undergoes structural changes to confer 
drug resistance to an infecting virus, may be a target protein for use in the 
disclosed methods. Such methods find particular utility in light of the fact that a 

35 reasonable estimate is that a significant number of infectious diseases found in 
developing countries are caused by viruses (J.N. Delgado & W. A. Renters, Eds 
"Wilson and Gisvold's Textbook of Organic Medicinal and Pharmaceutical 
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Chemistry," Tenth Edition, Lippincott-Raven, NY (1998)), as well as by 
bacterial infections. 

The viruses targeted by the present invention may be either DNA or 
RNA type viruses. Both types are obligate cellular parasites that depend upon a 
5 host for energy and the biochemical substances necessary for replication and 
protein synthesis. Thus, most existing drugs target virus-specific enzymatic 
processes required for the viral life cycle. Any existing drug or future drug may 
be used in the methods disclosed herein in order to predict future variants 
showing resistance to the drug. Two notable examples of successful antivirals 

10 developed with structure-based drug design include the influenza virus 

neuraminidase inhibitors targeted to block the release of mature virus particles 
from host cell sialic acid residues, and the HIV protease and HIV reverse 
transcriptase enzyme inhibitors. 

Structure-based drug design was used to develop die neuraminidase 

1 5 inhibitors oseltamivir and zanamivir (Tamiflu and Relenza) (Y.S. Babu et aL 9 J. 
Med. Chem. 43:3482-6 (2000)). Both of these drugs have been shown to be 
clinically effective in the prophylaxis and treatment of influenza A and B, and 
have demonstrated the development of minimal viral resistance in initial clinical 
studies (I.R- McNichoU et a/., Ann. Phannacother. 35(l):57-70 (2001); M.G. 

20 Ison et al 9 Cuit. Opin. Pharmacol. l(5):482-90 (2001); N.A. Roberts Prog. Drug 
Res. Spec No:35-77 (2001)). Although inhibitor-resistant forms of 
neuraminidase have been generated using in vitro passage in the presence of the 
drugs oseltamivir and zanamivir (M.T. Munell etaL, J. Virol. 75(14):63 10-20 
(2001); L.V. Gubareva et al, Antiviral Res. 53(1):47-61 (2002)), mutant forms 

25 of neuraminidase have started to appear in clinically-derived isolates (L.V. 
Gubareva et al, J. Infect Dis. 183(4):523-31 (2001)). Thus, new inhibitors 
specific for influenza neuraminidase are needed, and the methods of the present 
invention would find particular utility in analyzing the structure of existing and 
future drug-resistant variants so that new inhibitors may be designed. 

30 HIV protease and HTV reverse transcriptase have both been used as 

targets for structure-based drug design against HTV-1 retroviruses, although 
HIV-1 protease has been the most important target in the development of 
antiviral therapies against HIV-1 infection (Todd et al, Proteins 36: 147-156 
(1999)). The crystal structures of many wild type and mutant HIV proteases 

35 with and without bound inhibitors have been determined. This information has 
been very valuable and has led to development of the structure-based inhibitor 
Nelfinavir (also known as Viracept) (Kaldor et al y J. Med. Chem. 40:3979-85 
(1997)). In addition, many experiments have determined which HTV protease 
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10 



mutations are selected for during inhibitor monotherapy and combination 
therapy. Table 1 lists the mutations that are observed during protease inhibitor 
monotherapy, and there is a database available that lists sequence information 
for a large number of published HTV reverse transcriptase and HTV protease 
isolates, including inhibitor resistance information (http://hivdb.stanford.edu/). 

The methods of the present invention can be employed to analyze the 
three-dimensional structures of these isolates, and to design new inhibitors that 
target important domains of these enzymes with an understanding of how die 
proteins conform structurally to avoid the action of the drug. 

DRUG RESISTANT HIV PROTEASE MUTATIONS 
SAQUINAVIR RESISTANT MUTATIONS 



15 



G48V 



L90M 



LI 01 



L63P 



184V 



M46L 



♦A71V 



M361 



Location^) 



flap region 



adjacent to active site 



distant from active site 



distant from active site 



active site 



flap hinge region 



distant from active site 



distant from active site 



* putative global suppressor 



NELFINAVIR RESISTANT MUTATIONS 


MHtotionA 3.4) 


Location^) 






D30N 


active site 


♦A71V 


distant from active site 


184V 


active site 


M461 


flap hinge region 


* putative global suppressor 


AMPRENAVIR RESISTANT MUTATIONS 


Mutation^, 


jUocatipnft) 


I84V 


active site 


L10F 


distant from active site 


I50V 


flap region 


M46I 


flap hinge region 


147V 


flap hinge region 


D60V 


distant from active site 


RITONAVIR RESISTANT MUTATIONS 


Mutation (2.4.7) 


Location^ 


V82A 


active site 


154V 


Flap 


♦A71V 


distant from active site 


M46I 


Flap 


K20R 


distant from active site 


* putative global suppressor 
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10 Under native conditions, the dimeric HTV-protease molecule does not 

exist in a single conformation (Todd et al, 1999). The flap region located at the 
top of the protease binding site has very little structural stability, leading to many 
conformational states in the absence of substrate or inhibitor. The bottom of the 
binding site is the most stable part of the protein and contains the catalytic active 

15 site residues (residues 24-26). Upon drug, substrate or inhibitor binding, the flap 
region takes on a more ordered conformation. Todd et al, have shown that the 
effects of ligand binding and stabilization of die flap region are propagated 
throughout the molecule and most significantly to the bottom of the binding site 
(containing the catalytic residues). In contrast, the effects due to the interactions 

20 with the sides and the bottom of the active site are more local. 

Resistance to the clinically important inhibitors saquinavir, nelfinavir, 
amprenavir, and ritonavir have shown the development of resistance mutations 
in three "hotspot" areas (Table 1): residues M46, 147, G48, G49, and 150 are 
located in the flap region, a second group consists of residues R8, D25, G27, 

25 A28, D29, and D30 located in the bottom of the molecule near the active site, 

and the third group consists of residues P81, V82, and 184 located on the sides of 
die active site (Todd et al, 1 999). As expected, mutations at residues located in 
these hotspot regions have substantial effects on inhibitor resistance (Boden et 
al, 1998). Mutations located in the active site or within the flap region reduce 

30 inhibitor binding significantly but lead to protein destabilization and reduced 
viral reproduction (Jacobsen et al, 1996)(Bodene/a/, 1998). Compensatory 
mutations located outside of the binding site region stabilize the protease and 
improve viral reproductive capability (Markowitz e/ a/, 1995). Upon inhibitor 
challenge above certain thresholds, active site mutations have been shown to 

35 revert to wild type and are replaced with more efficient active site or flap region 
mutations (Mahalingam et al, 1999) (Partaledis et al, 1995). Thus, the 
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development of therapeutics containing a combination of different inhibitors is 
an important goal of treatment. 

The hotspot binding regions are structurally separated in the two HIV 
protease domains, and the two domains are linked by four strands. The first 
5 domain consists of a three p-strand sheet containing the flap region. The second 
domain contains the side and bottom regions of the active site, as well as the 
remainder of the protease molecule. Four strands link the two domains: residues 
32-42 (1 1 residues), residues 61-64 (4 residues), residues 66-73 (8 residues), and 
residues 78-83 (6 residues). The mutation A7 1 V is located on strand 66-73 

10 which connects the end of a 0-sheet in the first domain (flap region) with the end 
of a p-sheet in the second domain. 

In spite of the many resistance mutations that arise throughout the HIV 
protease molecule, any mutational effects that initiated from the first domain 
must still propagate to the second. In die position occupied by the A71 V 

1 5 mutation, the two domains are tethered together, preventing gross distortions 

between the two domains. Biochemical studies support this hypothesis of A71V 
stabilization. In addition, the A71V mutation is selected for in nelfinavir, 
ritonavir, as well as saquinovir resistance. Two additional mutations, one at 
position 84 in the active site region and one at position 46 in the flap region, are 

20 selected for in response to three or more inhibitors (Table 1). The A71 V 
mutation is the only mutation not located in the active site or flap region 
mutation that is also selected for in response to three or more different inhibitors, 
emphasizing its importance. Furthermore, die A71 V mutation is capable of 
reversing destabilizing mutations at multiple locations throughout the enzyme 

25 (Jacobsen et al (1996), Boden et al (1998), Patick et al (1996), and Markowitz et 
al (1995)), consistent with a possible role as a global suppressor mutation. Thus, 
a preferred HTV inhibitor of the present invention will be designed with 
consideration of the A71 V mutation. 

Utilization of a protease knockout of HIV-1 virus will enable the in-vitro 

30 evolution of the HTV-pro tease gene using methods of directed evolution (e.g. 
gene-shuffling, error-prone PCR). After rounds of evolution have been 
completed the protease gene will then be transfected back into the knockout 
virus in preparation for in-vivo selection. 
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In order to cany out the in- vivo selection human lymphoblastoid T-cell 
lines such as CEM-SS or MT-4 are used to propagate the HIV-1 virus 
transfected with evolved protease gene. The vims will be passaged in the 
selected lymphoblastoid T-cell line in the presence of a previously established 
5 HTV-protease inhibitor. After suitable adsorbtion period the cells are pelleted, 
washed, resuspended and plated on RPMI media plates containing a 
predetermined concentration of protease inhibitor. After a certain period of time 
lymphocyte colonies with no cytotoxic effect or syncytium formation were 
picked and cultured separately. The cell-free viral supernatant is collected and 

1 0 the protease gene is amplified from cellular DNA extracts by PCR and subjected 
to further rounds of evolution and selection. 

After the desired level of protease resistance is selected for the resulting 
protease genes are sequenced and cloned into an expression vector for in-vitro 
analysis and structure determination. 

15 In order to assay the activity of an evolved or drug-resistant HIV- 

pro tease in the presence of increasing concentrations of protease inhibitor the 
protease is combined with a solution buffered at an appropriate pH possibly 
containing dithiothreitol and EDTA. This stock solution is added to vials 
containing a predetermined concentration of protease inhibitor and allowed to 

20 incubate for a period of time. The reaction is initiated by the addition of a p- 
nitrophenylalanine substrate and incubated. After incubation the reaction is 
quenched with trifluoroacetic acid and the products quantified with HPLC or 
mass spectrometry. The data can be fitted to a competitive inhibition model and 
the Ki and IC50 determined. See Partaledis, J.A., et al. In Vitro selection and 

25 characterization of human immunodeficiency virus type 1 (HIV-1) Isolates with 
reduced sensitivity to hydoxyethylamino sulfonamide inhibitors of HIV-1 
aspartyl protease. 7. Virol 69(9), 1995 5228-5235. 

Anti-Cancer Agents 
30 Another important use of the methods described in this application 

involves the development of new small molecule anticancer agents that will be 
less amenable to the development of inhibitor resistant escapes when 
administered to a neoplastic cell population. A number of different signal 
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transduction pathways become deregulated in human cancers, leading to 
uncontrolled cellular proliferation. A number of different anticancer therapies 
exist; selected anticancer agents where resistance mechanisms have been studied 
are included in the following table. 

5 

Anticancer Agents 



Anticancer Agent 


Target & Mechanism 


Mechanism(s) of Resistance 


Methotrexate 
(Rheumatrex®) 


Antimetabolite (folic acid 
analog) that inhibits 
Dihydrofolate Reducatase 
(DHFR), butalso directly 
inhibitors folate- 
dependent enzymes of de 
novo purine and 
thymidilate synthesis 
pathways. 

Drug targets include: 
Dihydrofolate 
Reductase 

Thymidylate synthase 
Glycinamide 
ribonucleotide 
transformylase 


Mechanisms of resistance include: 
Impaired transport of 
methotrexate into cells 
Production of altered forms of 
Dihydrofolate reductase 
• (DHFR) that have decreased 
affinity for inhibitor 
Increased concentrations of 
intracellular DHFR 
Decreased ability to synthesize 
methotrexate polyglutamates 
Decreased thymidylate synthase 
activity 


Cytarabine 
(Cytosine 
Arabinoside) 
[Cytosar-U®] 


Antimetabolite 
(pyrimidine analog) that 
inhibits DNA synthesis 
Drug targets include: 
Deoxycytidine kinase 
Cytidine deaminase 
DNA Polymerase 


Resistance is poorly understood, but 
mechanisms of resistance include: 

Deficiency of deoxycytidne 

kinase 

Increased Cytidine Deaminase 
activity 

Reduced affinity for DNA 
polymerase 


Dacarbazine 

(DTIC) 

[Dtic-Dome®] 


Alkylating agents that 
target DNA synthesis 
enzymes including: 
Guanine-aOcyl 
transferase (DNA 
repair enzyme) 
Glutathione 
transferase 


Resistance is poorly understood, but 
mechanisms of resistance include: 
Decreased permeation of 
actively transported drugs 
Increased production of 
nucleophilic substances, such as 
glutathione that can compete 
with die target DNA for 
alkylation 

Increased activity of the DNA 
repair enzymes 


Imatirib Mesylate 
(Gleevec™) 


Drug target: 

Bcr-Abl Kinase 


Resistance caused by single point 
mutation, leading to reactivation of 
kinase activity. 


Doxurubicin 
(Adriamycin, 


Anthracylcine antibiotic 
that targets DNA and 
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Anticancer Agent 


Target & Mechanism 


Mechanism(s) of Resistance 


Rubrex) 
Mitoxantrone 


RNA synthesis 

Topoisomerase II 
Cytochrome P450 
reductase 





Recently, the successful small molecule drug candidate Gleevec was 
made using rational drug design, and has been shown to be an effective in vivo 
inhibitor of the Bcr-Abl tyrosine kinase responsible for the genetic malfunction 
5 present in chronic myelogenous leukemia (CML) (B J. Druker et al , Blood 
(Suppl.) 94:368a (1999);T. Schindler etal. t Science 289:1938-42 (2000)). 
Unfortunately, as is the situation with most conventional anticancer drugs, 
patients with advanced forms of cancer often relapse, with their tumor cells 
becoming resistant to the chemotherapeutics. In the case of Gleevec, a single 

1 0 point mutation in the Bcr-Abl kinase (Thr3 1 5 lie) was found in all relapsed 

patients, and was found to cause reactivation of the kinase activity. The methods 
as outlined herein can be used to predict new sites of mutation, as well as new 
mutant combinations that can arise upon application of inhibitor selection during 
a directed evolution procedure. The new Bcr-Abl kinase mutant proteins can 

15 then be used as targets for design of new inhibitors based on SBDD, VLS, or any 
of the ligand screening procedures applied to the new mutants or both the new 
mutant and wild-type protein targets. 

Inhibitors of topoisomerases are widely used in the treatment of cancer, 
including inhibitors of topoisomerase I (camptothecin analogs such as irinotecan 

20 and topotecan) and topoisomerase II (etoposide and doxorubicin). 

Topoisomerase drug resistance is commonly due to overexpressing P- 
glycoprotein or multidrug resistance-associated protein, however, some 
camptothecin (CPT)-resistant topoisomerase I (topi) proteins have been isolated, 
harboring specific topi mutations that are related to their CPT-resistance. For 

25 example, the R364H mutation is close to the catalytic tyrosine and other well- 
known mutations leading to CPT resistance (Urasaki et aL Cancer Res 
61(5): 1964-9 (2001). In addition, CPT-resistant cell lines with their 
corresponding mutations have been isolated for human prostate carcinoma cells 
DU-145/RC1 (mutation R364H), Chinese hamster fibroblasts DC3F/C10 

30 (mutation G503S), and human leukemia CEM/C2 cells (N722S) Urasaki et al. 
Cancer Res 61(2):504-8 (2001). Structure-based drug design has recently been 
proposed for wild-type and mutant forms of topoisomerase 
(http://dtp.nci.tuTi.gov However, the methods 

as outlined herein can be used to design new inhibitors of both the wild-type and 
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mutant topoisomerase proteins based on SBDD, VLS, or any of the ligand 
screening procedures applied to both protein targets. 

Some cancers result from the mutation of genes involved in the control of 
cell division, for example, tumor suppressor genes such as p53. One 
5 embodiment of the present invention would therefore involve the design of drugs 
that restore activity to proteins that normally act to control cell division, 
including p53, but that have mutated, thereby causing or contributing to 
malignant transformation or remission or recurrence of malignancy following 
drug treatment. A preferred novel drug according to the present invention would 
10 stabilize all mutant forms of p53, for instance, that would be predicted to arise 
during the course of a drug treatment Such drugs could be administered 
individually or as part of a combined therapeutic regimen. 

Directed Evolution Methods 

15 As used herein, "directed evolution** refers to the analysis of target 

protein structure and function by mutagenesis, phenotypic selection, screening or 
assessment, function evaluation, and/or sequence analysis. A "directed 
evolution profile** for a target protein is a collection of structural data comprising 
the structural changes that have occurred as the target protein evolves into a 

20 series of drug-resistant variants. Other data can be included in the directed 

evolution profile for a target protein such as protein activity data, drug-resistance 
data, protein stability data, frequency of mutation and the like. 

Several references disclosing directed evolution techniques are discussed 
below and are incorporated by reference in their entirety at die location of these 

25 citations. The molecular cloning techniques required for an underlying 

appreciation of directed evolution methodology are now commonplace, as those 
of skill in the art will appreciate. For a general review of cloning and expressing 
genes and analyzing mutations, reference may be made to Sambrook et al. 9 
Molecular Cloning, A Laboratory Manual 2nd ed., Cold Spring Harbor 

30 Laboratory Press, 1 989 (which is herein incorporated by reference for all it 
teaches and all the methods it references). 

Natural selection does not try out all possible combinations of mutations 
at random, but rather proceeds in a series of steps, each step leading to a 
phenotype that is more fit than the previous generation. The appearance of 

35 mutations in antibiotic resistance genes, viral genes, or cancer-related genes is 
selected for by the use of antibiotics, inhibitors, or other therapeutics in clinical 
settings. For example, natural evolution of (^-lactamase has taken place in 
response to an array of different penicillin derivatives, cephamycins, and four 
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generations of cephalosporins. Even though there are a large number of different 
antibiotics prescribed in the clinic, only 13 sites of mutation (out of 290 total 
residues) in the P-lactamase gene are sufficient to give rise to extended-spectrum 
p-lactamase antibiotic resistance. 
5 Directed evolution is a powerful tool to quickly identify key structural 

changes that can give rise to antibiotic resistance. As provided by the present 
invention, directed evolution experiments have determined mutation hotspots, 
have correctly predicted the clinical emergence of a three-mutant combination, 
and additionally have predicted the clinical emergence of even higher order 

10 combinations of these mutations. Structural analysis complements directed 
evolution in the methods of the invention. For instance, the present invention 
describes the structural basis of increased cefotaxime resistance due to die 
E104K/M182T/G238S combination of mutations. 

In general, mutations isolated early in the directed evolution process 

IS often act by reorganizing the active site topology, while later mutations can act, 
for example, as compensatory mutations. For example, the later-isolated Ml 82T 
mutation is a global suppressor mutation that is located in the hinge region 
between the two domains of p-lactamase. The M 1 82T mutation may act by 
stabilizing the proper orientation of the. active site residues. Therefore, according 

20 to die present invention, a new class of antibiotics or inhibitors can be designed 
to inhibit folding of the native p-lactamase enzyme conformation. 

Any method available to one of skill in the art can be used for generating 
mutations to perform directed evolution. Methods contemplated by die present 
invention include, for example, the use of natural selection, DNA analogs, DNA 

25 shuffling, hypermutator E. coli strains, and random mutagenesis methods. These 
methods can vary in the quantity and type of mutations that they produce, but all 
have the potential to mimic natural evolution. Accordingly, combinations of 
mutagenesis methods are also contemplated. 

Directed evolution methods can introduce mutations at rates of up to 

30 about 3 percent (Stemmer, 370 Nature 389-91 (1994); Zaccolo et al, 285 J. Mol. 
Biol. 775-83 (1999)). A mutation rate of about 0.1% is equivalent to about one 
amino acid mutation per one kilobase DNA. The directed evolution method 
using the incorporation of DNA analogs to introduce mutations has been found 
to mutate particular bases more frequently than others (Zaccolo et al (1999)), 

35 resulting in nonnatural and nonrandom distributions (Skandalis et al., 4 Chem. 
Biol. 889-98 (1997)). DNA shuffling (Stemmer (1994)) applies a low 
mutagenesis rate using polymerase error, and additionally provides for DNA 
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recombination, which is an efficient method to create new combinations of drug 
resistance mutations. 

Methods for performing mutagenesis by the DNA analog method are 
described in Zaccolo et ai (1999). Mutagenesis by the in vitro DNA shuffling 
5 method can be performed as described in S temmer ( 1 994), which is herein 
incorporated by reference. In one embodiment, the in vivo DNA shuffling 
strategy is used that takes advantage of mutagenesis using a hypermutator strain 
of E. coli by the methods described in Echols et al. 9 80 Proc. Nat'l. Acad. Sci. 
USA, 2189-92 (1983) and Low et al 9 260 J. Mol. Biol. 359-68 (1996). This 

10 procedure removes the PCR step necessary with in vitro DNA shuffling and the 
DNA analogs method. 

Prior to the first directed evolution study, there were nineteen naturally 
occurring TEM isolates (Stemmer (1994)). In the year 2000, there were over 
ninety TEM ({-lactamase mutants (Jacoby et al. 9 

1 5 htQ>^/www.lahey.org/studies/webthtm (2000)). The DNA analog and DNA 
shuffling methods have each identified mutations at five positions in wild type 
p-lactamase (TEM-1) that have also been obtained in clinical isolates (Fig.1). 
DNA shuffling also predicted three mutations at positions 42, 92 and 182 before 
their clinical isolation (Fig. 2). In a 1994 study, DNA shuffling predicted the 

20 E104K/M182T/G238S combination of mutations to be added to the TEM-1 
isolate, leading to a mutant with an observed 500-fold increase in antibiotic 
resistance relative to wild type TEM-1 (Stemmer (1994)). Four years later, this 
mutant combination was found in die clinical isolate .TEM-52 (Poyart etal., 42 
Antimicrob. Agents Chemother. 108-13 (1998)). This triple mutant was also 

25 obtained by directed evolution utilizing the DNA analogs method (Zaccolo 
(1999)), and additionally was obtained in the present study using in vivo 
shuffling with an E. coli hypermutator strain. The unexpected identification of 
the E104K/M182T/G238S combination of mutations using three different 
directed evolution techniques, and the additional isolation of this triple mutant in 

30 nature (TEM-52) highlights the importance of these residue positions in 
developing an antibiotic resistance phenotype and suggests that a preferred 
pathway exists for developing an increased resistance to P-lactam antibiotics. 

For instance, Fig. 1 is a graph depicting the frequency of mutations 
obtained depending on the directed evolution methodology employed as 

35 compared to natural isolates, and illustrates that directed evolution can identify 
the same ^-lactamase extended-spectrum antibiotic resistant mutations as are 
observed in nature (clinical settings). Eight of the eleven mutations obtained 
with directed evolution have been found in nature (indicated with arrows). The 
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highest frequency naturally occurring mutations at positions 104, 164, 238 and 
240 have all been identified by directed evolution experiments. In addition, 
three different directed evolution methods identified the E104K/M182T/G238S 
combination of mutations, which is also present in the clinical isolate TEM-52 as 
5 discussed above. Frequencies of clinical mutations were derived from the data 
provided in the Jacoby website database at 

http://www.lahey.org/studies/webt.htm (2000). Frequency values were derived 
by determining the number of times a specific mutation appeared in the 
extended-spectrum p-lactamase mutant population and dividing by the total 
10 number of extended spectrum ^-lactamase mutants (56 total; for this number, 
there were 80 mutants total minus 24 inhibitor resistant mutants). Directed 
evolution frequencies were calculated by determining the number of occurrences 
of a specific mutation in the experiments analyzed, divided by the total number 
of isolates analyzed. 

IS Fig. 2 is a time-line illustrating trends that have been observed in 0- 

lactamase mutation combinations and the order of mutation appearance. As 
mentioned above, a DNA shuffling experiment in 1994 predicted the mutation 
combination E104K/M1 82T/G238S as well as predicting mutations at positions 
42 and 92 before their appearance in the clinic (these mutations were observed in 

20 1998 (triple mutant), 1996 (position 42 mutant) and 1999 (position 92 mutant)). 
The evolution of clinically isolated resistance mutants closely mimics the 
pathway described by the three directed evolution studies. 

In addition to mutational 'hotspot' identification (Fig. 1), another 
powerful use of directed evolution studies is prediction of new, hitherto 

25 unobserved mechanisms that can lead to drug resistance. Such mechanistic 
predictions permit more effective design of antibiotics and design of inhibitors 
with a reduced tendency to developing resistance to them. According to the 
present invention, such mechanistic predictions can be made by isolating and 
studying the order of individual structural and functional changes produced by a 

30 sequential mutation and selection scheme. 

Thus, according to the present invention, a number of mutations have 
been found to characterize the extended substrate spectrum of TEM-type 
lactamases. An R164S or R164H substitution leads to high resistance to the 
antibiotic ceftazadime, but low levels of resistance to cefotaxime (Jacoby et 

35 a/.(2000); Giakkoupi et «/., 45 J. Antimicrob. Chemother. 101-04 (2000); Knox, 
39 Antimicrob. Agents Chemother. 2593-2601 (1995); Matagne et al. f 330 
Biochem. J. 581-98 (1998)). The G238S mutation confers high resistance to 
both ceftazadime and cefotaxime, whereas mutation at either position 69 or 244 
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confers resistance to P-lactamase inhibitors such as tazobactam or clavulanate 
(Knox (1995)). However, the R164S and G238S combination of mutations are 
deleterious to extended-spectrum p-lactamase activity (Giakkoupi et al. (2000)) 
and are only encountered in one natural isolate, and only then in the presence of 
5 the stabilizing mutation E104K (Jacoby et al. (2000); Raquet et al., 23 Proteins 
63-72(1995)). 

In a scheme proposed by Giakkoupi et al. (2000) and Du Bois et al. 9 35 J. 
Antimicrob. Chemother. 7*22 (1995), the most likely sequence of mutational 
events is first R164S, then E104K, and then G238S. In fact, this trend was 

10 detected by die DNA analogs experiment using different levels of cefotaxime 
selection (Zaccolo et al. 9 (1999)). Low levels of selection produced a R164S 
mutation, increased levels of selection pressure produced the R164S/E104K 
combination of mutations, and further mutation of this isolate combined with 
selection at high cefotaxime concentrations resulted in position 164 reverting to 

15 wild type and the isolation of the E104K/M182T/G238S combination of 
mutations (TEM-52) (Fig. 2). 

In addition, Long-McGie et al. y 68 Biotechnol. Bioeng. 121-25 (2000) 
reported on a related phagemid-based in vivo directed evolution method using 
TEM-1 ^-lactamase, where they obtained a triple mutant similar to TEM-52. 

20 Their mutant contained the same two early mutations as found in TEM-52 
(E104K/G238S) along with a third mutation at T265M, allowing for a mutant 
displaying a similar increased cefotaxime resistance. These results lend further 
support to the importance of mutational hotspots in the evolution of extended- 
spectrum antibiotic resistant P-lactamases. 

25 DNA shuffling has further shown that the addition of one signal 

sequence mutation (A 18V) to TEM-52 increased cefotaxime resistance by 
16,000-fold (Stemmer (1994)). Three substitutions (A42G, G92S and R241H) 
to TEM-52 were found to increase resistance 32,000-fold. This ensemble of six 
mutations has not yet been isolated in nature, but an alarming observation is that 

30 individual mutations at five of the six positions have been observed in different 
clinical isolates. Residue positions 42, 92 and 241 are all located in loop regions 
of TEM-52, and possibly act to stabilize the reorganized active site shape (Fig. 
3a). The stepwise accumulation of mutations observed during the emergence of 
extended-spectrum antibiotic resistance makes it highly probable that higher 

35 order combinations of the predicted mutations will be clinically isolated. 
Therefore, understanding the development and mechanism of antibiotic 
resistance conferred by the TEM-52 mutations, as well as determining the 
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location and role of future mutations predicted to be incorporated in the TEM-52 
enzyme architecture, are of utmost importance. 

Structural Analysis 

5 Any method known to one of skill in the art for localizing an active site 

or a functional domain within a protein may be used For example, the primary 
amino acid sequence of a protein can be ascertained by DN A and/or protein 
sequencing methods. The amino acids which are present in the active site are 
those that play a role in the function of the target protein. Such key amino acids 

10 can be identified by observing which amino acids are responsible for phenotypic 
changes when the target protein is mutated. Similarly, homologies to known 
"consensus" protein sequences can identify an active site of a target protein. 
Thus, the primary amino acid sequence of a target protein can be compared to 
that of known, conserved amino acid sequences which have a known function. 

1 5 Homologous target protein domains can have the same function as that of the 
"consensus" protein domain. 

However, according to the present invention, in addition to information 
about the primary amino acid sequence, three-dimensional structural information 
is also needed to optimally design compounds which can interact with and 

20 modulate the activity of die target protein. For three-dimensional structural 

analysis, proteins may be expressed and purified using any technique known to 
one of skill in the art. For example, see Sambrook, J. et al y Molecular Cloning: 
a Laboratory Manual, pp. 17.37-17.41, Cold Spring Harbor Laboratory Press 
(1989); and Rudolph, R. et aL y FASEB J. 10:49-56 (1995). Three-dimensional 

25 structural analysis can also be performed by any method known to one of skill in 
the art For example, such three-dimensional structural analysis can be 
performed by NMR spectroscopy, electron crystallography or X-ray diffraction 
crystal analysis. Preferred structural analysis is by X-ray diffraction crystal 
analysis, with crystals effectively diffracting X-rays for the determination of 

30 atomic coordinates to a resolution of better than about 5.0 Angstroms, and more 

preferably better than about 3.5 Angstroms, and even more preferably better than 

about 2.5 to about 2.75 Angstroms. 

Perhaps the best known way of determining protein structure is X-ray 

crystallography. See Physical Biochemistry, Van Holde, K. E. (Prentice-Hall, 
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New Jersey 1971), pp. 221-239, and Physical Chemistry with Applications to the 
Life Sciences, D. Eisenberg & D. C. Crothers (Benjamin Cummings, Menlo 
Park 1979). With X-ray crystallography it is possible to elucidate three- 
dimensional structure with good precision. Protein structures can also be 
5 determined by neutron diffraction and nuclear magnetic resonance (NMR). See, 
e.g., Physical Chemistry, 4th Ed. Moore, W. J. (Prentice-Hall, New Jersey 
1972), and NMR of Proteins and Nucleic Acids, K. Wuthrich (Wiley- 
Interscience, New York 1 986). Proteins may also be elec trocrystal 1 ized for a 
determination of three-dimensional structure, for instance as described in U.S. 

10 Patent 5,597,457, which is herein incorporated by reference in its entirety. A 
preferred method of crystallization may be found in U.S. Patent No. 6,296,673 
which is herein incorporated by reference in its entirety. 

The three-dimensional structure of some proteins may also be obtained 
from public databases. For instance, protein structures of atomic resolution are 

1 5 catalogued by Brookhaven National Laboratory and made available to the public 
via the Internet, eg., at http://www.rcsb.org. The Protein Data Bank (PDB) is an 
archival computer database of three-dimensional structures of biological 
macromolecules. The database contains atomic coordinates, bibliographic 
citations, primary sequence and secondary structure information, as well as 

20 crystallographic structure factors and 2D-NMR experimental data. Information 
is available on protein, DN A, RNA, virus and carbohydrate structures. For 
instance, die atomic coordinates of beta-lac tamase TEM52 are available under 
PDB Accession code 1HTZ, which is herein incorporated by reference. 

In summary, the structure of the drug resistant target proteins and 

25 variants can be accomplished as follows. 

Single crystals of each drug resistant protein are be mounted on an x-ray 
source such as a rotating anode x-ray generator or a synchrotron. Diffraction 
images of die protein in question will then be collected utilizing die technique of 
oscillation photography. The data collected by the above method will then be 

30 processed to determine crystal symmetry, unit cell parameters, and other 

necessary data collection parameters with the programs Denzo, Mosflm or die 
D*trek suite. 
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Once the above parameters have been determined and refined, Denzo, 
Mosflm or the D*trek suite, will be used to measure the integrated intensity of 
all the reciprocal lattice points contained in die diffraction images collected. 
This data will then be scaled and reduced by combining redundant measurements 
5 using any of the programs, Scalepack, Scala, or the D*trek program 

dtscaleaverage. In this way a list of reciprocal lattice points and their associated 
miller indices will be generated along with each points average integrated 
intensity and associated error. This list of data points will then be processed 
further, for example, by using the CCP4 suite of crystallography programs. In 

10 this case the data will be passed through the program TRUNCATE to convert 
integrated intensities to structure factor amplitudes, then through the program 
CAD to ensure the data represents the correct area of reciprocal lattice space. 

Phase information will be obtained in the following manner. If a 
structure homologous to the one being solved exists phase information will be 

1 5 obtained by the method of molecular replacement In this method die orientation 
and position of a search model will be optimized in order to achieve the best 
possible fit between data derived from the search model and experimental data 
collected above. This procedure will be carried out with any one or more of the 
following programs: AMORE or MOLREP in the CCP4 suite, or with programs 

20 such as EPMR or any other program capable of fitting search model to 

experimental crystallographic data. Once obtained the phases will be refined by 
first building a molecular model of the protein structure into the observed data 
either manually with the program O or XT AL VIEW, or automatically with 
programs such as ARP/WARP or SOLVE/RESOLVE. Phases will be refined 

25 further by refining the molecular model against crystallographic, geometric, and 
chemical data using programs such as REFMAC, CNS, or SHELX. 

Drag Design 

The present invention provides methods for identifying new drugs, 
30 antibiotics, inhibitors and modulators of target antibiotic-resistance proteins that 
are characterized by their three-dimensional structures. Such drugs, antibiotics, 
modulators and inhibitors may be competitive, i.e., binding to all or a portion of 
the active site of the target proteins, or non-competitive, Le., binding to and 
inhibiting the target protein whether or not it is bound to another chemical entity. 
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Drugs and new drugs can act directly or indirectly on a target protein or variant 
to prevent or reduce drug resistance caused by that target protein. 

Antibiotics are drugs that have cytostatic or cytotoxic effects on target 
organisms. The key to success for an antibiotic is selectivity for the disease 
5 target, and lack of toxicity to the patient The most useful antibiotics against 
infections are those which attack a microbe-specific target as discussed above. 

One structure-based drug design approach is to probe the target protein 
crystal with molecules composed of a variety of different chemical entities to 
determine optimal sites for interaction between candidate antibiotics or inhibitors 

10 and target proteins. For example, high resolution X-ray diffraction data 

collected from crystals saturated with solvent allows the determination of where 
each type of solvent molecule sticks. Small molecules that bind tightly to those 
sites can then be designed and synthesized and tested for inhibitory activity (J. 
Travis, Science, 262:1374 (1993)). 

1 S This invention also enables the development of compounds that can 

isomerize to short-lived reaction intermediates in the chemical reaction of a 
substrate or other compound that binds to or with a target protein. Thus, the 
time-dependent analysis of structural changes in a target protein during its 
interaction with other molecules is permitted. The reaction intermediates of a 

20 target protein can also be deduced from analysis of the antibiotic or inhibitor in 
co-complex with a target protein. Such information is useful to design improved 
analogues of known antibiotics and target protein inhibitors or to design novel 
classes of antibiotics and inhibitors based on the reaction intermediates observed 
with a target protein inhibitor co-complex. This provides a novel route for 

25 structure-based drug design of antibiotics and inhibitors with both high 
specificity and stability. 

Another structure-based drug design approach made possible by this 
invention is the ability to perform virtual ligand screening to computationally 
screen small molecule data bases for chemical entities or compounds that can 

30 bind in whole, or in part, to a target protein. In this screening, the quality of fit 
of such entities or compounds to the binding site may be judged either by shape 
complementarity or by estimated interaction energy (E. C. Meng et al, J. Comp. 
Chem., 13:505-524 (1992)). 

Because a target protein may crystallize in more than one crystal form, 

35 the structure coordinates of a target protein, or portions thereof, are particularly 
useful to solve the structure of those other crystal forms of target proteins. They 
may also be used to solve die structure of a mutant target protein, or to solve the 
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structure of the crystalline form of any other protein with significant amino acid 
sequence homology to any functional domain of such a target protein. 

' One method that may be employed for this purpose is molecular 
replacement. In this method, the unknown crystal structure, whether it is another 
5 crystal form of a target protein, or a mutant target protein, or an antibiotic-target 
protein co-complex, or a crystal of some other protein with significant amino 
acid sequence homology to any functional domain of a target protein, may be 
determined by using the coordinates of related known target proteins. This 
method will provide an accurate structural form for the unknown crystal more 

10 quickly and efficiently than attempting to determine such information ab initio. 

Thus, known three-dimensional structures permit structure-based drug 
design via screening of known molecules and/or designing of new molecules 
which bind to a target protein structure, particularly at the active site, via the use 
of computerized evaluation systems. For example, computer modeling systems 

15 are available in which the sequence and structure of a target protein (i.e. , the 
bond angles, dihedral angles, distances between atoms in the active site region, 
etc.) may be input. Thus, a machine readable medium may be encoded with data 
representing the coordinates of a target protein or a target protein active site. 
The computer then generates structural details of the site into which a test 

20 compound should bind, thereby enabling the determination of the 
complementary structural details of the test compound. 

The invention therefore provides a computer-assisted method 600 for 
identifying a new drug effective against at least one drug-resistant variant of a 
target protein (see Fig. 6). The method generally involves obtaining the three 

25 dimensional structure of at least one variant site of interaction with die drug by 
using computer processing to generate x, y and z coordinates for each atom in 
the variant site of interaction 602. The three dimensional structure of die target 
protein site of interaction with the drug is also obtained by using computer 
processing to generate an x, y and z coordinates for each atom in the target 

30 protein site of interaction 604. The three dimensional structure of the target 
protein site of interaction may be available and need not be obtained every time 
a new variant is isolated. Another step in the computer-assisted method 600 is 
measuring the differences in distance between the x, y and z coordinates for each 
atom in the variant and target protein sites of interaction 606. Such 

35 measurements permit one of skill in die art to understand how the drug 

interaction site may have changed in response to the drug and how to better 
design a new drug. Hence, die last step in the computer-assisted method 600 is 
designing a new drug to interact with the site of interaction of at least one of the 
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variants by using computer processing to fit a library of potential new drugs into 
the variant and target protein sites of interaction and determining which potential 
new drug fits into the variant site of interaction better than into the target protein 
site of interaction 608. 
5 The invention also provides unique combinations of data fields and data 

structures useful for practicing the invention (see e.g. Fig. 5). One example of 
such a data structure 500 stores information on directed evolution of a series of 
drug resistant variants. This data structure 500 comprises a series of separate 
variant data fields 502, each variant data field comprising each x, y and z atomic 

10 coordinate of each drug resistant variant The data structure 500 also includes a 
target protein data field 504 comprising each x, y and z atomic coordinate of the 
target protein. The data structure 500 can further include a structural difference 
data field 506 comprising die differences in x, y and z atomic coordinates for 
each drug resistant variant relative to the x, y and z atomic coordinates of the 

15 target protein. The data structure 500 can also have a series of drug resistance 
data fields 508, each drug resistance data field comprising each x, y and z atomic 
coordinate of each drug that is resistant to a drug resistance variant. 

The invention also provides an article of manufacture having instructions 
stored on it that cause a computing system (e.g. 702) to identify a new drug 

20 effective against at least one drug-resistant variant of a target protein, the 

instructions comprising: instructions for comparing a series of separate variant 
data fields with a target protein data field to generate a structural difference data 
field comprising the differences in x, y and z atomic coordinates for each drug 
resistant variant relative to the x, y and z atomic coordinates of the target protein, 

25 wherein each variant data field within the series of separate variant data fields 
comprises each x, y and z atomic coordinate of each drug resistant variant, and 
wherein a target protein data field comprises each x, y and z atomic coordinate 
of the target protein; instructions for generating a new drug data field from die 
structural difference data field and a small molecule data field, wherein the small 

30 molecule data field comprises x, y and z atomic coordinates for a series of small 
molecules and wherein the new drug data field comprises small molecules that 
can optimally fit within the site of interaction of die drug-resistant variant New 
drugs identified using the article of manufacture are also provided by the 
invention. 

35 Examples of small molecule inhibitors that were designed based on 

protein three-dimensional structure information are available for HTV protease 
(L. Hong et al., Protein Sci. 9: 1898-1904 (2000); J.V.N.V. Prasad et al. 9 Bioorg. 
Med. Chem. 7:2775-2800 (1999); A. Wlodawere/a/., Annu. Rep. Biophys. 
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Biomol. Struct. 27:249-84 (1998)), renin (J. Rahuel etal., Chem. Biol. 7:493- 
504 (2000)), Factor Xa (S. Maiganan et ai, Curr. Topics Med. Chem. 1:161-74 
(2001)), thrombin (N.Y. Chirgadze et aL 9 Protein Sci. 9:29-36 (2000), B.A. Katz 
et aL 9 J. Mol. Biol. 307: 145 1-86 (2001)), urokinase (V.L. Nienaber et ai, 
5 Structure 8:553-63 (2000)), rhinovirus 3C protease (S.H. Reich et al. 9 J. Med. 
Chem. 43: 1670-1683 (2000)), cathepsin K (R.W. Marquis et aL 9 J. Med. Chem. 
44:725-36(2001)), Abl kinase (T. Schindleref a/., Science 289:1938-42 (2000)), 
CDKs (T.G. Davies et ai, Structure 9:389-97 (2001), P. Furet et al 9 J. Comput 
Aided Mol. Des. 14:403-9 (2000), N.S. Gray et al y Science 281:533-8 (1998), 

10 S.-H. Kim Pure Appl. Chem. 70:555-65 (1998)), EGFR kinase (S. Ghosh et al y 
Curr. Cancer Drug Targets 1 : 129-40 (2001)), Lck kinase (X. Zhu et a/., Structure 
7:651-61 (1999)), Src kinase (T.K. Sawyer et ai y Expert Opin. Invest. Drugs 
10:1327-44 (2001), W.C. Shakespeare et al y Proc. Natl. Acad. Sci. USA 
15:9373-8 (2000), W. Xu et al. 9 Mol. Cell 3:629-36 (1999)), as well as aldose 

15 reductase (Y. Iwata et al., J. Med. Chem. 44:1718-28 (2001)), phospholipase-A2 
(E.D. Mihelich et aL> Biochim. Biophys. Acta 1441 :223-8 (1999)), STAT (J. 
Hurkson et al y Oncogene 19:6594-9 (2000)), Bcl-2 (Z. Huang Oncogene 
19:6627-31 (2000)), neuraminidase (Y.S. Babu et al y J. Med. Chem. 43:3482-6 
(2000)), and glyceraldehyde-3-phosphate dehydrogenase (J.C. Bressi et al. 9 J. 

20 Med. Chem. 44:2080-93 (2001)). 

More particularly, the design of drugs and compounds that bind to or 
inhibit a target protein according to this invention generally involves 
consideration of two factors. First, die drug or compound must be capable of 
physically and structurally associating with a target protein. Non-covalent 

25 molecular interactions important in die association of a target protein with 

antibiotics and inhibitors include hydrogen bonding, van der Waals, hydrophobic 
and/or covalent interactions. Second, the drug or compound must be able to 
assume a conformation that allows it to associate with a target protein. Although 
certain portions of die drug or compound will not directly participate in this 

30 association with a target protein, those portions may still influence the overall 
conformation of the molecule. This, in turn, may have a significant impact on 
potency. Such conformational requirements include die overall three- 
dimensional structure and orientation of die chemical entity or compound in 
relation to all or a portion of the binding site, e.g., active site or accessory 

35 binding site of a target protein, or the spacing between functional groups of a 
compound comprising several chemical entities that directly interact with a 
target protein. 
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The potential inhibitory or binding effect of a drug or compound on a 
target protein may be analyzed prior to its actual synthesis and testing by the use 
of computer modeling techniques. If the theoretical structure of the given drag or 
compound suggests insufficient interaction and association between it and a 
5 target protein, synthesis and testing of the compound is obviated. However, if 
computer modeling indicates a strong interaction, the molecule may then be 
synthesized and tested for its ability to bind to a target protein and inhibit using a 
suitable assay. In this manner, synthesis of inoperative compounds may be 
avoided. 

1 0 Compounds and drugs capable of binding to a target protein of the 

present invention may be computationally evaluated and designed by means of a 
series of steps in which chemical entities or fragments are screened and selected 
for their ability to associate with the individual binding pockets or other areas of 
the target protein, in the mutant and wild-type forms of the protein. Once a 

1 5 three-dimensional structure of a biological macromolecule (target) has been 
determined by X-ray, NMR, or modeling, one can then perform structure-based 
drug design to computationally fit millions of small molecule compounds into 
complimentary regions of the target These regions of the target structure can be 
active sites, allosteric sites, regulatory sites, global suppressor sites, partner 

20 binding sites, etc. 

One skilled in the art may use one of several structure-based drug design 
methods to computationally screen chemical entities or fragments for their 
ability to associate with a target protein and more particularly with the individual 
binding pockets of the target protein active site or accessory binding site. This 

25 process may begin by visual inspection of, for example, the active site on the 
computer screen based on available target protein coordinates. Selected 
fragments or chemical entities may then be positioned in a variety of 
orientations, or docked, within a binding pocket of a target protein, in the mutant 
and wild-type forms of the protein. Docking may be accomplished using 

30 software such as Quanta and Sybyl, followed by energy minimization and 

molecular dynamics calculations with standard molecular mechanics force fields, 
such as CHARMM and AMBER. 

Specialized computer programs may also assist in the structure-based 
drug design process of selecting fragments or chemical entities. These include: 

35 GRID [P. J. Goodford, "A Computational Procedure for 

Determining Energetically Favorable Binding Sites on Biologically 
Important Macromolecules", J. Med. Chem., 28:849-857 (1985)]. GRID 
is available from Oxford University, Oxford, UK. 
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MCSS [A. Miranker and M. Kaiplus, "Functionality Maps of 
Binding Sites: A Multiple Copy Simultaneous Search Method", Proteins: 
Structure, Function and Genetics, 1 1 :29-34 (1991)]. MCSS is available 
from Molecular Simulations, Burlington, Mass. 
5 AUTODOCK [D. S. Goodsell and A. J. Olsen, "Automated 

Docking of Substrates to Proteins by Simulated Annealing", Proteins: 
Structure, Function, and Genetics, 8:195-202 (1990)]. AUTODOCK is 
available from Scripps Research Institute, La Jolla, Calif. 

DOCK [I. D. Kuntz et al, "A Geometric Approach to 
10 Macromolecule-Ligand Interactions", J. Mol. BioL, 161:269-288 (1982)]. 

DOCK is available from University of California, San Francisco, Calif. 
Once suitable chemical entities or fragments have been selected, they can 
be assembled into a single antibiotic or inhibitor compound. Visual inspection of 
the relationship of the fragments to each other on die three-dimensional image 
15 displayed on a computer screen in relation to the structure coordinates of the 
target proteins is followed by manual model building using software such as 
Quanta or Sybyl. 

Useful programs to aid one of skill in the art in connecting the individual 
chemical entities or fragments include: 

20 CAVEAT [P. A. Bartlett et al, "CAVEAT: A Program to 

Facilitate the Structure-Derived Design of Biologically Active 
Molecules", in Molecular Recognition in Chemical and Biological 
Problems", Special Pub., Royal Chem. Soc. 78, pp. 182-196 (1989)]. 
CAVEAT is available from the University of California, Berkeley, Calif. 

25 3D Database systems such as MACCS-3D (MDL Information 

Systems, San Leandro, Calif.). This area is reviewed in Y. C. Martin, 
"3D Database Searching in Drag Design", J. Med. Chem., 35:2145-2154 
(1992). 

HOOK (available from Molecular Simulations, Burlington, 

30 Mass.). 

Instead of proceeding to build a target protein inhibitor or antibiotic in a 
step-wise fashion, building up from one fragment or chemical entity at a time as 
described above, inhibitory or antibiotic compounds may be designed as a whole 
or "de novo" using either an empty active site or optionally including some 
35 portion(s) of a known inhibitors). These methods include: 

LUDI [H.-J. Bohm, "The Computer Program LUDI: A New 
Method for the De Novo Design of Enzyme Inhibitors", J. Comp. Aid. 
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Molec. Design, 6:61-78 (1992)]. LUDI is available from Biosym 
Technologies, San Diego, Calif. 

LEGEND [Y. Nishibata and A. Itai, Tetrahedron, 47:8985 

(1991)]. 

5 LEGEND is available from Molecular Simulations, Burlington, Mass. 

LEAPFROG (available from Tripos Associates, St Louis, Mo.). 
Other molecular modeling techniques may also be employed in 
accordance with this invention. See, e.g., N. C. Cohen et al, "Molecular 
Modeling Software and Methods for Medicinal Chemistry", J. Med. Chem., 

10 33:883-894 (1990). See also, M. A. Navia and M. A. Murcko, "The Use of 

Structural Information in Drug Design", Current Opinions in Structural Biology, 
2:202-210 (1992). For example, where the structures of test compounds are 
known, a model of the test compound may be superimposed over the model of 
the structure of die invention. Numerous methods and techniques are known in 

15 the art for performing this step, any of which may be used. See, e.g. , P. S. 

Farmer, Drag Design, Aliens, E. J., ed., Vol. 10, pp 1 19-143 (Academic Press, 
New York, 1980); U.S. Pat No. 5,331,573; U.S. Pat No. 5,500,807; C. 
Veriinde, Curr. Biol., 2:577-587 (1994); and I. D. Kuntz, Science, 257:1078- 
1082 (1992). The model building techniques and computer evaluation systems 

20 described herein are not a limitation on the present invention. 

Thus, using these computer evaluation systems, a large number of 
compounds may be quickly and easily examined, and expensive and lengthy 
testing avoided. Moreover, the need for actual synthesis of many compounds is 
effectively eliminated. 

25 Another embodiment of die present invention involves the use of high- 

throughput ligand screening methods to identify small molecules capable of 
binding or interacting with a target Once a mutant or wild type target protein is 
isolated (purified), or expressed in a cell line that is coupled to any readout, 
hundred to thousands (to millions for ultra high-throughput) of small molecules 

30 can be screened against the target molecule for detecting a change in a 

biophysical property of the target or the readout molecule. Readouts include 
radioactivity, fluorescence, colorimetric, etc. 

Once identified by the modeling or screening techniques, the antibiotic or 
inhibitor or antibiotic-resistance inhibitor may be tested for bioactivity using 

35 standard techniques. For example, test antibiotics and inhibitors can be tested in 
binding assays using conventional formats to screen for activity or binding. One 
particularly suitable assay format includes the enzyme-linked immunosorbent 
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assay (ELISA). Other assay formats may be used; these assay formats are not a 

limitation on the present invention. 

In another aspect of the present invention, target protein structures permit 

the design and identification of synthetic compounds and/or other molecules 

5 which are characterized by the conformation of the target proteins. Using known 

computer systems, the coordinates of die structures of the invention may be 

provided in machine readable form, the test antibiotics and inhibitors designed 

and/or screened, and their conformations superimposed on the structures of die 

target proteins of the invention. Subsequently, suitable candidates identified as 

10 above may be screened for the desired inhibitory activity, stability, and the like. 

An additional embodiment of the present invention involves the use of 

Structure Activity Relationships to develop, design, or refine candidate small 

molecule inhibitors or antibiotics. Using a variety of biophysical techniques, 

such as screening, CD, NMR, X-ray, modelling, Mass spec, etc..., structure 

1 5 activity relationships can be determined for any target molecule against a variety 
of small molecule inhibitors (Tollenaere, JJ\ Pharm. World Sci. 18(2): 56-62 
(1996) ). In drag design, structure-activity relationships (SAR) use the 
relationship between chemical structure and pharmacological activity for a series 
of compounds to derive and design improved drug species. Another algorithm 

20 used is Quantitative Structure- Activity Relationships (QSAR), which are 
mathematical relationships linking chemical structure and pharmacological 
activity in a quantitative manner for a series of compounds. Methods which can 
be used in QSAR include various regression and pattern recognition techniques. 
In general, structure-property correlations (SPC) refer to all statistical 

25 mathematical methods that are used to correlate any structural property to any 
other property (intrinsic, chemical or biological), using statistical regression and 
pattern recognition techniques. Finally, three-dimensional quantitative structure- 
activity relationships (3D-QSAR) are die analysis of the quantitative relationship 
between the biological activity of a set of compounds and their spatial properties 

30 using statistical methods, (all definitions from 

http://www.chem.qmw.ac.uk/iupac/medch«n/ix.htm^s5). At 
http://www.pharmacopeia, SAR is defined as an analysis which defines the 
relationship between the structure of a molecule and its ability to affect a 
biological system (be that a cell, protein, enzyme, or organism). 
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The invention also provides a system 700 that includes a computer 
system 702; memory 704 accessible to the computer system 702, a series of data 
structures 706 stored in the memory and one or more software components 
operable on the computer system. A computer system 702 is any type of device 
5 having at least a processor and accessible memory 704 and associated networks 
and peripherals, such as local area networks (LANS) and printers. See e.g., Fig. 
7. Examples of computers include personal computers, laptops, supercomputers, 
and any other kind of computer. The computer system 702 is capable of 
operating according to instructions, software components, and data structures. 

10 In particular, the invention provides a system 700 comprising a computer 

system 702; memory 704 accessible to the computer system 702; a first data 
structure 706 stored in the memory 704 comprising three dimensional structural 
data of a target protein; a second data structure 706 stored in the memory 704 
comprising a first population of three dimensional structural data for a series of 

1 5 variants of die target protein, wherein the variants were generated by one or 

more rounds of directed evolution of the target protein in the presence of a drug; 
a third data structure 706 stored in the memory 704 comprising a second 
population of three dimensional structural data for a series of second generation 
variants of a first variant, wherein the second generation of variants is generated 

20 by one or more rounds of directed evolution of the first variant in the presence of 
a drug; a fourth data structure 706 comprising three dimensional structural data 
for a population of potential new drugs comprising antibiotics, enzyme 
inhibitors, protein inhibitors, antibiotic resistance inhibitors, herbicide resistance 
inhibitors, insect repellants, insecticide resistance inhibitors, viral drug resistance 

25 inhibitors, chemotherapeutic resistance inhibitors, modulators, antagonists, 
agonists, effectors, ligands, antibodies, antibody fragments, peptides or small 
molecules; a first software component 706 operable on the computer system 702 
to compare the first data structure with the second or third data structure and to 
identify three dimensional structural variants that represent a drug resistance 

30 evolution profile of said target protein; a second software component 706 

operable on the computer system 702 to compare the fourth data structure 706 
with die drug resistance evolution profile of said target protein and thereby to 
design a new drug effective against at least one drug resistance variant 
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Once identified and screened for biological activity, these new drugs, 
antibiotics, inhibitors and modulators may be used therapeutically or 
prophylactically to block target protein activity, and thus, cellular or viral 
viability. 

5 

EXAMPLES 

The following examples are provided to describe and illustrate die 
present invention. As such, they should not be construed to limit the scope of 
die invention. Those in the art will well appreciate that many other 
10 embodiments also fall within the scope of the invention, as it is described 
hereinabove and in the claims. 

Example 1: Beta-Lactamase (TEM52) Penicillins 

According to the present invention, antibiotic resistance involves the 
1 5 accumulation of mutations beneficial to the protein while maintaining residue 
interactions and core packings that are critical for preserving function. The 
constraints of maintaining stability, while increasing activity, drastically reduces 
the number of possible mutational combination pathways. To test this theory, the 
evolution of TEM-1 p-lactamase was directed using a hypermutator£ coli- 
20 based directed evolution technique with cefotaxime selection. The selected 
mutants were compared to two previous directed evolution studies and a 
database of clinical isolates. In all cases, evolution resulted in the generation of 
the E104K/M182T/G238S combination of mutations with 500-fold increased 
resistance, which is equivalent to clinical isolate TEM-52. The structure of 
25 TEM-52 was determined to 2.4 A. The G238S mutation widens access to the p- 
lactamase active site by 2.8 A whereas the E104K mutation stabilizes the 
reorganized topology of p-lactamase. The Ml 82T mutation is located 17 A from 
the active site and appears to be a global suppressor mutation that acts to 
stabilize the new enzyme structure. 

30 

Methods 

Mutator strain directed evolution. Each cycle of in vivo DNA 
shuffling of ^lactamase consisted of four steps: (i) infection, mutagenesis, and 
packaging of a TEM-1 P-lactamase-encoding phagemid, pBAD18 (ATCC), in 
35 the hypermutator strain MG 163 (generated by phage PI transduction of mutD5 
(Echols et a/., 80 Proa Nat'l Acad. Sci. 2189-92 (1983); Low et al y 260 J. Mol. 
Biol. 359-68 (1996)) from donor strain DM2516 (a gift of R.A. Kelln) into the 
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F* recA + strain NM522 via the linked transposon zaf-13::Tnl0); (ii) 
superinfection of the shuffler strain MG 109 (generated by transduction of strain 
NM522 with a PI lysate grown on a mutS::Tn5 donor (a gift of D. Thayler)) 
with die mutagenized phagemid pool for recombination; (iii) selection on 
5 increasing levels of cefotaxime (Sigma); and (iv) pooling resistant colonies and 
rescuing phagemid for the next round of shuffling. Substitution of wild-type 
strain NM522 in the in vivo DNA shuffling protocol was used as a non-mutator 
control. 

Protein production, crystallization and structure determination. The 

1 0 TEM-52 gene was PCR amplified (primers: 5- 

ATGAGTATTCAACATATGTTCCTGGTCGCC-3* (SEQ ID NO:l) and 5*- 
TTACCAATGCTTGGATCCAATCAGTGAGGC-3' (SEQ ID NO:2)), cloned 
into pET3 0a with Ndel and BamHL, and transformed into BL2 1 (DE3) pLysS . 
The DNA sequence of the TEM-52 open reading frame is as follows (SEQ ID 

15 NO:3): 

ATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCl'1'l'I'l'lGCGG 
CATTTTGCCITCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAG 
TAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATC 

20 GAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCC 
CGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATG 
TGGCGCGGTATTATCCCGTGTTGACGCCGGGCAAGAGCAACTCG 
GTCGCCGCATACACTATTCTCAGAATGACTTGGTTAAGTACTCAC 
CAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAA 

25 TTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAAC 
TTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTT 
TTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAA 
CCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCAC 
GACGCCTGCAGCAATGGCAACAACGTTGCGCAAACTATTAACTGG 

30 CGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGA 
TGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTC 
CGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCAGTGAGCGTG 
GGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCT 
CCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGG 

35 ATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTA 
AGCATTGGTAA 
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The TEM-52 protein was purified according to the method described in 
Jelsch et al, 16 Proteins 364-83 (1993). The sequence of the TEM-52 protein is 
as follows (SEQ ID NO:4). 

5 MSIQHFRVALIPFFAAFCLPWAH^^ 

DIJ4SGKILESFRPEERFPMMSTFKVLLCGAVI^RVDAGQEQLGRRH 
YSQNDLVKYSPVTEKHLTDGMTV^ 
GGPKELTAFUINMGDHVTRLDRWEPEI24EAIP^^ 
TITJUOXTGEIXTIJ^SR^L^ 
10 KSGASfiRGSRGHAAIXjPIXjKPSRIVVIYTTGSQATMDERNRQ 
ASLDCHW 

The TEM-52 protein was crystallized by hanging drop method (14 mg/ ml) over 
16% PEG 8K,125 mM MgOAc, 25 mM Na Citrate, pH 6.5. Crystals appeared 
15 after 3 days; space group P4 3 2j2 (a = b = 88.4 A, c = 500.4 A), six molecules per 
asymmetric unit. 

Data were collected to 2.4 A on three frozen crystals at the Advanced 
light Source beamline 5.0.2, and processed with DENZO and SCALEPACK 
(Otwinowski et al. y 276 Methods Enzymol. 307-26 (1997)). Because of the long 

20 c-axis, several attempts were made on three crystals in different orientations to 
collect a missing cone of data and completeness up to 90% was obtained. In 
addition, attempts were made to process the data in a hexagonal or rfaombohedral 
lattice since a hexamer was observed in the tetragonal space group. However, all 
attempts at processing in the higher symmetry were unsuccessful. Molecular 

25 replacement with ITEM as the model (Jelsch et al. 9 16 Proteins 364-83 (1993)) 
using the program AMORE (Navaza 50 Acta Crystallogr. A157-63 (1994)) 
located five of the six molecules. An F D - F c map revealed density for a sixth 
molecule. All six molecules were rebuilt using 0(Jones et al. y 47 Acta 
Crystallogr. A 1 10-19 (1991)) and refined using CNS (Brunger et ai y 54 Acta. 

30 Crystallogr. D 905-2 1 (1 998)) until convergence at 2.4 A with a Roy* = 21 .7 and 
= 26. 1 . Five of the six molecules adhere to non-crystallographic symmetry, 
with the sixth molecule being very similar in structure, but not able to refine with 
NCS due to packing differences. The coordinates have been deposited in the 
Protein Data Bank (accession code 1HTZ); which are herein incorporated by 

35 reference in their entirety. 

Table 2 Data Collection and Refinement Statistics 
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Data Collection 





Resolution (A; 


o AO ATI 




Molecules per asymmetric unit 


O 




Number of reflections 


/30,VUO 


c 


.Number oi unique reuecuons 


71 71 1 




Redundancy 1 


10(0) 




I/a 1 


22.4 (7.1) 






11.5 (43.2) 




Completeness (%)' 




10 


Refinement (for all six molecules) 






Resolution (A) 


20— 2.4U 




Reflections 


66,385 




Wilson B-factor (A 2 ) 
Rc^(%) 3 /R to (%) 4 


43.5 




21.7/26.1 


15 


Protein Atoms 


10,140 




Number of waters 


587 




R.m.s deviations 






Bond lengths (A) 


0.006 




Bond angles (°) 


1.3 


20 


Average r.m.s.deviation 5 






Residues 100-110 


0.49 




Residues 237-244 


0.35 



25 1 Numbers in parenthesis denote values for highest resolution bin 

(2.46-2.40 A). 

2 FIX EQUATIONRsymm = E| I hi - <Ih>| /S for all h, where I h 
is the intensity of reflection h. 

3 FIX EQUATIONRoyst = S| | F 0 |- |F| |/E| | F 0 | . 

30 4 Rftce was calculated with 10% of the reflections not included in 

refinement All reflections were used in the final round of 
refinement 

5 Noncrystallographic symmetry (NCS) was applied to five of the 
six molecules. Restraints were left off the sixth molecule because 
35 of the differences caused by crystal packing. In the final round of 

building and refinement NCS was removed for residues 100-1 10 
and 237-244. 

TEM-52 Structure 

40 To understand the role of the mutations El 04K/M1 82T/G238S in natural 

and directed evolution, the crystal structure of TEM-52 was determined to 2.4 A 
(Figs. 3, 4 and 5). Fig. 3 provides alternative views of the TEM-52 three- 
dimensional structure. Fig. 3a illustrates that both nature and directed evolution 
experiments identified the E104K/M182T/G238S combination of mutations 

45 (green/red/green) and DNA shuffling predicted the loop mutations at positions 
42, 92 and 241 (red). Catalytic residue positions are black and the B3 p-strand is 
red. Fig. 3b provides an overlay stereo view of wild type TEM-1 (cyan) and 
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TEM-52 (blue) illustrating movement of loops 238-243, 267-271 and 40-43. 
The G238S mutation (green) causes the Glu 240 side chain conformation to 
change (compare blue and purple side chains), which widens the active site by 
2.8 A, potentially allowing binding of bulky cephalosporins (catalytic Ser 70 
5 side chain is shown in yellow). Fig. 3c provides a stereoview of the electron 
density for the TEM-52 active site. 

The active site is located at the interface of the two p-lactamase domains. 
One domain consists of a five-stranded nScoIIsheet packed against three a- 
helices, while the second domain consists of eight a-helices. The overall TEM- 

1 0 52 structure is very similar to wild type p-lactamase (TEM-1) with a root mean 
square (r.m.s.) deviation of 0.66 A comparing the CALPHA traces. The 
mutations generated in early rounds of selection (at positions 104 and 238) are 
involved in reorganizing the shape of the active site (Fig. 3b) while a mutation 
generated in later stages of the selection (position 1 82) may compensate for die 

1 5 destabilizing effects introduced by the initial mutations. 

An individual G238S mutation results in an eight-fold increase in 
minimum inhibitory concentration (MIC) (see Stemmer, 370 Nature 389-91 
(1994)), but the single mutant was found to be thermally destabilized (see 
Raquet et al y 23 Proteins 63-72 (1995)). In TEM-52, Ser 238 forms two new 

20 hydrogen bonds from the hydroxyl group: one to the Ser 243 backbone amide 
(2.9 A, average over six molecules) and one to the Ser 243 hydroxyl group (3.2 
A) (Fig. 3 b,c). Loop 238-243 that contains this mutation is altered by as much 
as 2.8 A (measured by die CALPHA positions of Glu 240 for TEM-52 versus 
TEM-1), widening die opening to the active site and potentially facilitating 

25 binding of bulky cephalosporin antibiotics. Movement of loop 238-243 results 
in the reorganization of adjacent loops 267-271 and 40-43. In addition, loop 
238-243 repositioning causes the backbone carbonyl of Glu 240, which was 
hydrogen bonded with the Ser 243 backbone amide in TEM- 1 , to rotate 1 55° in 
TEM-52, forming a hydrogen bond with the Gly 242 backbone amide group. In 

30 TEM-1 , flie Glu 240 carboxylate group was thought to hinder binding of some 
cephalosporins and P-lactamase inhibitors by steric repulsion (see Medeiros, 24 
Clin. Infect Dis. (Suppl. 1), S19-45 (1997)). However, in the TEM-52 structure 
this side chain is repositioned out of the active site and oriented over the B3 
TtZcoII-strand, reducing die possibility of any interference with substrate or 

35 inhibitor binding. 

The second TEM-52 mutation, E104K, when observed as a single 
mutation in die wild type enzyme background, is found to increase the MIC for 
cefotaxime four-fold (see Stemmer, 370 Nature 389-91 (1994); Raquet et al. 3 23 
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Proteins 63-72 (1995)). This mutation acts synergistically with the G238S 
mutation, with the double mutant displaying a 267-fold increase in MIC versus 
TEM-1 (see Stemmer, 370 Nature 389-91 (1994)). In the crystal structure of 
TEM-52, for the six molecules that are located in die asymmetric unit, all six 
5 Lys 104 residues have well-defined electron density (Fig. 3c). This is 
particularly striking since Lys 104 is located near the surface, and surface- 
exposed lysine residues are typically disordered. In TEM-52, Lys 104 from the 
E104K mutation is oriented into the active site toward the reorganized Glu 240 
side chain. In the wild type structure, the Glu 104 and Glu 240 carboxyl groups 
10 were separated by 6.8 A (Vanhove et ai 9 54 Cell. Mol. Life Sci. 372-77 (1998)). 
However, in TEM-52, the E104K and Ghi 240 side chains are only 4.8 A apart 
(Fig. 3c). 



M182T is a global suppressor mutation 

1 5 Perhaps the most interesting mutation found in TEM-52 is Ml 82T, 

which has the ability to reverse the destabilizing effects of antibiotic- or 
inhibitor-resistant mutations (see Huang et al., 94 Proc. Nat'l. Acad. Sci. 8801- 
06 (1997); Farzaneh et aL 9 40 Antimicrob. Agents Chemother. 2434-36 (1996)). 
M182T is the only mutation found in both antibiotic-resistant and inhibitor- 

20 resistant p-lactamases, suggesting a potential role as a global suppressor. The 
Ml 82T mutation in combination with the E104K/G238S mutations increases the 
cefotaxime MIC to 500 jig/ml. 

Fig. 4 depicts the structure of the region around the global suppressor 
mutation M182T found in TEM-52. Fig. 4a provides the structure of the P- 

25 lactamase Ml 82T mutant (light purple), and identifies the location of this 

mutation as being in a hinge region (arrow) between two domains and 17 A from 
the active site. Thr 182 (light purple) in one domain (blue) forms new 
interactions with residues 63 (green) and 64 (purple) in the second domain 
(yellow). Fig. 4b provides an alternative view of TEM-52, illustrating the 

30 interactions relevant to the Thr 1 82 (light purple) mutation. Ml 82T 

compensates for the destabilizing mutations found at position 69 (inhibitor- 
resistant mutants) and positions 164 or 238 (antibiotic-resistant mutations) 
(yellow), potentially contributing to stabilization of the reorganized shape of the 
active site (catalytic residues shown in red). Small molecules designed to 

35 destabilize the region around position 1 82 may lead to a novel class of 
antibiotics or inhibitors. 

Amino acids leading from die region around position 182 to the catalytic 
triad do not generally tolerate substitutions, and are thought to be necessary for 
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proper core packing and catalytic residue orientation (Huang et a/., 258 J. MoL 
Biol. 688-703 (1996)). Phe 66 (a highly conserved residue in Class A p- 
lactamases) and Pro 67 are involved in hydrophobic core packing interactions; 
Thr 71 and Lys 73 are important for proper positioning of the catalytic residues 
5 Ser 70 and Asn 132, while Asp 176, Asp 179, Thr 180 and Thr 181 interact with 
the active site CAPITAL OMEGA-loop that contains the catalytic residue Glu 
166. The hydroxyl group of Thrl82 forms two new hydrogen bonds with the 
backbone carbonyls of residues Glu 63 (2.9 A) and Glu 64 (3.3 A), increasing 
the interaction between the two domains and possibly reducing the 

10 conformational flexibility of the hinge region between the two domains. The 
global suppressor Ml 82T has been previously proposed to reverse the 
destabilizing effects of antibiotic-resistant mutations at positions 238 (Poyart et 
al. y 42 Antimicrob. Agents Chemother. 108-13 (1998)) and possibly position 164 
(Yang, 42 Antimicrob. Agents Chemother. 1671-76 (1998)). M182T may also 

1 5 help to stabilize the inhibitor-resistant mutation at position 69 (Medeiros, 24 
Clin. Infect Dis. (Suppl. 1) S19-45 (1997); Huang et al., 94 Proc. Nat'l. Acad. 
Sci. USA 8801-06 (1997)). 

Global suppressor mutations have been described for phage P22 coat 
protein (Aramli et al, 274 J. Biol. Chem. 22217-24 (1999), several forms of p53 

20 (Nikolova et al., 19 EMBO J. 370-78 (2000)), P-lactamase (Huang et al, 94 
Proc. Nat'l. Acad. Sci. USA 8801-06 (1997)), and have been presumed for HIV 
protease (id.). By a mechanism similar to that proposed to restore function to 
tumorigenic p53 variants (Nikolova et al, 19 EMBO J. 370-78 (2000)), a small 
molecule targeted to a global suppressor site may act to either stabilize or 

25 destabilize the native fold. Isomerization of the Glu 1 66-Pro 1 67 peptide bond to 
a cis conformation in ^-lactamase is thought to occur in the last phase of the five 
phases in die proposed protein folding pathway (Vanhove et al., 54 Cell. Mol. 
life Sci. 372-77 (1998)). This isomerization step is most likely the rate-limiting 
step (time constant of 300 s) in P-lactamase folding, resulting in collapse of the 

30 ft-loop residues onto the main body of die enzyme (V anhove et al , 54 Cell. 
MoL Life Sci. 372-77 (1998)). Without isomerization of the Pro 167 bond, the 
H-loop is prevented from laying down on top of the hinge region, making the 
M182T global suppressor region more exposed. Until the Q-loop assumes the 
correct position, the catalytic residues are improperly positioned, making p- 

35 lactamase inactive. A small molecule targeted to the global suppressor region 
could possibly bind to die inactive ^-lactamase intermediate, preventing Q-loop 
collapse and the proper assembly of the catalytic residue geometry. As 
described above, mutations in this region of P-lactamase are not generally 
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tolerated, and thus, development of resistance by mutations elicited to a bound 
molecule at this location would be less likely to occur. 

Structural consequences of further mutations to TEM-52 
5 Several combinations of mutations have not been observed in the clinic, 

but are likely to appear if increased levels of cefotaxime are continually used. 
For example, mutation E240K was identified by the DN A analogs study 
(Zaccolo et al. y 285 J. Mol. Biol. 775-83 (1999)) and has been observed in 
thirteen naturally occurring isolates (Fig. 1). A lysine at position 240 of TEM- 

10 52 would most likely allow this side chain to interact directly with antibiotic 
substrates (Matagne et al. y 330 Biochem. J. 581-98 (1998)). Additionally, the 
G238S mutation in TEM-52 causes a minor reorganization of loops 267-271 and 
40-43. Consistent with this movement, the DNA shuffling experiment was 
observed to evolve an A42G mutation. A different substitution at this position, 

1 5 A42V, has been observed (Jacoby et al 9 ht^p://www.lahey.oig/studies/webthtm 
(2000)) and may fill the space produced by adjacent loop 267-271 movement, 
potentially providing further stabilization of the enzyme (Fig. 3b). Finally, the 
DNA shuffling experiment identified an A18V ({-lactamase mutation, located in 
die signal sequence region, providing for an alternative evolutionary mechanism 

20 to increase antibiotic resistance without the requirement of any direct changes in 
catalytic active site structure. 

Drug design based on the TEM-52 ^-lactamase mutant 

Lead small molecule candidates were selected based on docking with the 
25 TEM-52 (PDB code 1HTZ) or TEM-1 (PDB code 1 TEM) active sites, using the 
virtual ligand screening (VLS) technology (ICM Main and ICM-Dock) available 
from MolSoft, LLC. A library of over 200,000 small molecule compounds were 
computationally screened using ICM-Dock, and a number of lead inhibitors were 
ligand screened and identified that interact with either TEM52 or TEM1 or both. 

30 

EXAMPLE 2: Drug Design Procedures 

The present invention provides methods for identifying new drugs such 

as antibiotics, inhibitors and modulators of target antibiotic-resistance proteins 
that are characterized by their three-dimensional structures. Such antibiotics and 
35 inhibitors may be competitive, i.e. , binding to all or a portion of die active site of 
the target proteins, or non-competitive, binding to and inhibiting the target 
protein whether or not it is bound to another chemical entity. 
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One structure-based drug design approach is to probe the target protein 
crystal with molecules composed of a variety of different chemical entities to 
determine optimal sites for interaction between candidate antibiotics or inhibitors 
and target proteins. For example, high resolution X-ray diffraction data 

5 collected from crystals saturated with solvent allows the determination of where 
each type of solvent molecule sticks. Small molecules that bind tightly to those 
sites can then be designed and synthesized and tested for inhibitory activity (J. 
Travis, Science, 262:1374 (1993)). 

This invention also enables die development of compounds that can 

1 0 isomerize to short-lived reaction intermediates in the chemical reaction of a 
substrate or other compound that binds to or with a target protein. Thus, Ae 
time-dependent analysis of structural changes in a target protein during its 
interaction with other molecules is permitted. The reaction intermediates of a 
target protein can also be deduced from analysis of the antibiotic or inhibitor in 

1 5 co-complex with a target protein. Such information is useful to design improved 
analogues of known antibiotics and target protein inhibitors or to design novel 
classes of antibiotics and inhibitors based on the reaction intermediates observed 
with a target protein inhibitor co-complex. This provides a novel route for 
structure-based drug design of antibiotics and inhibitors with both high 

20 specificity and stability. 

Another structure-based drug design approach made possible by this 
invention is die ability to perform virtual ligand screening to computationally 
screen small molecule data bases for chemical entities or compounds that can 
bind in whole, or in part, to a target protein. In this screening, the quality of fit 

25 of such entities or compounds to die Once a three-dimensional structure of a 
biological macromolecule (target) has been determined by X-ray, NMR, or 
modeling, one can then perform structure-based drug design to computationally 
fit millions of small molecule compounds into complimentary regions of the 
target These regions of the target structure can be active sites, allosteric sites, 

30 regulatory sites, global suppressor sites, partner binding sites, etc. 

One skilled in the art may use one of several structure-based drug design 
methods to computationally screen chemical entities or fragments for their 
ability to associate with a target protein and more particularly with the individual 
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binding pockets of the target protein active site or accessory binding site. This 
process may begin by visual inspection of, for example, the active site on the 
computer screen based on available target protein coordinates. Selected 
fragments or chemical entities may then be positioned in a variety of 
5 orientations, or docked, within a binding pocket of a target protein, in the mutant 
and wild-type forms of the protein. Docking may be accomplished using 
software such as Quanta and Sybyl, followed by energy minimization and 
molecular dynamics calculations with standard molecular mechanics force fields, 
such as CHARMM and AMBER. 
1 0 Specialized computer programs may also assist in the structure-based 

drug design process of selecting fragments or chemical entities. These include: 
GRID [P. J. Goodford, "A Computational Procedure for 
Determining Energetically Favorable Binding Sites on Biologically 
Important Macromolecules", J. Med. Chem., 28:849-857 (1985)]. GRID 
1 5 is available from Oxford University, Oxford, UK. 

MCSS [A. Miranker and M. Kaiplus, "Functionality Maps of 
Binding Sites: A Multiple Copy Simultaneous Search Method", Proteins: 
Stiucture, Function and Genetics, 11:29-34 (1991)]. MCSS is available 
from Molecular Simulations, Burlington, Mass. 
20 AUTODOCK [D. S. Goodsell and A. J. Olsen, "Automated 

Docking of Substrates to Proteins by Simulated Annealing", Proteins: 
Structure, Function, and Genetics, 8: 195-202 (1990)]. AUTODOCK is 
available from Scripps Research Institute, La Jolla, Calif. 

DOCK [I. D. Kuntz et al, "A Geometric Approach to 
25 Macromolecule-Iigand Interactions", J. Mol. Biol., 161:269-288 (1982)]. 

DOCK is available from University of California, San Francisco, Calif. 
Once suitable chemical entities or fragments have been selected, they can 
be assembled into a single antibiotic or inhibitor compound. Visual inspection of 
the relationship of the fragments to each other on the three-dimensional image 
30 displayed on a computer screen in relation to the structure coordinates of die 
target proteins is followed by manual model building using software such as 
Quanta or Sybyl. 
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Useful programs to aid one of skill in the art in connecting the individual 
chemical entities or fragments include: 

CAVEAT [P. A. Bartlett et al, "CAVEAT: A Program to 
Facilitate the Structure-Derived Design of Biologically Active 
5 Molecules", in Molecular Recognition in Chemical and Biological 

Problems", Special Pub., Royal Chem. Soc. 78, pp. 182-196 (1989)]. 
CAVEAT is available from the University of California, Berkeley, Calif. 

3D Database systems such as MACCS-3D (MDL Information 
Systems, San Leandro, Calif.). This area is reviewed in Y. C. Martin, 
10 "3D Database Searching in Drug Design", J. Med. Chan., 35:2145-2154 

(1992). 

HOOK (available from Molecular Simulations, Burlington, 

Mass.). 

Instead of proceeding to build a target protein inhibitor or antibiotic in a 
1 5 step-wise fashion, building up from one fragment or chemical entity at a time as 
described above, inhibitory or antibiotic compounds may be designed as a whole 
or "de novo" using either an empty active site or optionally including some 
portion(s) of a known inhibitors). These methods include: 

LUDI [H.-J. Bohm, "The Computer Program LUDI: A New 
20 Method for the De Novo Design of Enzyme Inhibitors", J. Comp. Aid. 

Molec. Design, 6:61-78 (1992)]. LUDI is available from Biosym 
Technologies, San Diego, Calif. 

LEGEND [Y. Nishibata and A. Itai, Tetrahedron, 47:8985 

(1991)]. 

25 LEGEND is available from Molecular Simulations, Burlington, Mass. 

LEAPFROG (available from Tripos Associates, St Louis, Mo.). 
Other molecular modeling techniques may also be employed in 
accordance with this invention. See, e.g., N. C. Cohen et al, "Molecular 
Modeling Software and Methods for Medicinal Chemistry", J. Med. Chem., 
30 33:883-894 (1990). See also, M. A. Navia and M. A. Murcko, "The Use of 

Structural Information in Drug Design", Current Opinions in Structural Biology, 
2:202-210 (1992). For example, where the structures of test compounds are 
known, a model of the test compound may be superimposed over the model of 
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the structure of the invention. Numerous methods and techniques are known in 
the art for performing this step, any of which may be used. See, e.g., P. S. 
Farmer, Drug Design, Ariens, E. J., ed., Vol. 10, pp 1 19-143 (Academic Press, 
New York, 1980); U.S. Pat No. 5,331,573; U.S. Pat. No. 5,500,807; C. 

5 Verlinde, Curr. Biol., 2:577-587 (1994); and I. D. Kuntz, Science, 257: 1078- 
1082 (1992). The model building techniques and computer evaluation systems 
described herein are not a limitation on the present invention. 

Thus, using these computer evaluation systems, a large number of 
compounds may be quickly and easily examined, and expensive and lengthy 

10 testing avoided. Moreover, the need for actual synthesis of many compounds is 
effectively eliminated. 

All publications, patents and patent applications are incorporated herein 
by reference. While in the foregoing specification this invention has been 
described in relation to certain preferred embodiments thereof, and many details 
1 5 have been set forth for purposes of illustration, it will be apparent to those skilled 
in the art that the invention is susceptible to additional embodiments and that 
certain of the details described herein may be varied considerably without 
departing from the basic principles of the invention. 



60 



WHAT IS CLAIMED: 

1 . A method for identifying a new drug effective against at least one drug- 
resistant variant of a target protein, comprising: 

(a) selecting at least one drug-resistant variant of a target protein in 
the presence of a drug; 

(b) obtaining the three dimensional structure of at least one variant 
site of interaction with the drug; 

(c) obtaining the three dimensional structure of the target protein site 
of interaction with the drug; 

(d) comparing the three dimensional structures of the target and 
variant sites of interaction to identify structural similarities and 
differences; and 

(e) designing a new drug to interact with at least one of die variants 
by using the structural similarities or differences. 

2. The method of claim 1, wherein the three dimensional structures of the 
target and variant sites of interaction are determined using x-ray 
crystallography, NMR, or molecular modeling. 

3. The method of claim 1 that further comprises contacting the new drug 
with the target protein or with at least one chug-resistant variant thereof 
to compare die interaction of the new drug with the target protein and 
drug-resistant variant thereof. 

4. The method of claim 3, wherein the interaction of the new drug with the 
target protein or with at least one drug-resistant variant thereof is 
compared by assaying the function of the target protein or the drug- 
resistant variant thereof in the presence of the drug. 

5. The method of claim 3, wherein the interaction of the new drug with the 
target protein or with at least one drug-resistant variant thereof is 
compared by co-crystallizing die new drug and the target protein or the 
drug-resistant variant thereof. 

6. The method of claim 1 , wherein at least one drug-resistant variant of the 
target protein is isolated from a patient who is or was undergoing 
treatment with the drug. 
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7. The method of claim I , wherein at least one drug-resistant variant of the 
target protein is selected by at least one cycle of directed evolution. 

8. The method of claim 7, wherein the directed evolution is accomplished 
by using random mutagenesis, DNA analogs or DNA shuffling. 

5 9. The method of claim 8, wherein the directed evolution is performed in a 
hypermutator organism. 

10. The method of claim 1 , wherein the new drug is designed to interact with 
both the target protein and at least one drug-resistant variant of the target 
protein. 

10 11. The method of claim 1, wherein the new drug is designed to interact with 
regions of at least one drug-resistant variant that are structurally different 
from analogous regions in the target protein. 

12. The method of claim 1, wherein the new drug is designed to inhibit the 
folding of one or more regions of the drug-resistant variants) or the 

15 target protein. 

1 3 . The method of claim 1 , wherein the drug is an antibiotic, an enzyme 
inhibitor, a protein inhibitor, an antibiotic resistance inhibitor, a herbicide 
resistance inhibitor, an insect repellant, an insecticide resistance inhibitor, 
a viral drug resistance inhibitor, a chemotherapeutic resistance inhibitor, 

20 a modulator, an antagonist, an agonist, an effector, a ligand, an antibody, 

an antibody fragment, a peptide or a small molecule. 

14. The method of claim 1 3, wherein said modulator interacts with the drug- 
resistant variants) of the target protein, thereby rendering the drug- 
resistant variant susceptible to said drug. 

25 15. The method of claim 13, wherein said modulator interacts with the dnig- 
resistant variants) of the target protein, thereby rendering the drug- 
resistant variant susceptible to a new drug. 

16. The method of claim 1, wherein the new drug is an antibiotic, an enzyme 
inhibitor, a protein inhibitor, an antibiotic resistance inhibitor, a herbicide 
30 resistance inhibitor, an insect repellant, an insecticide resistance inhibitor, 

a viral drug resistance inhibitor, a chemotherapeutic resistance inhibitor, 
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a modulator, an antagonist, an agonist, an effector, a ligand, an antibody, 
an antibody fragment, a peptide or a small molecule. 

1 7. The method of claim 1 6, wherein the new drug is selected from a 
database of inhibitors or ligands. 

5 18. The method of claim 16, wherein said database consists of small 
molecules. 

19. The method of claim 1 6, wherein the new drug is a combination of at 
least two peptides or two small molecules. 

20. The method of claim 1 , wherein the new drug is designed using structure 
10 based drug design (SBDD), virtual ligand screening, ligand screening, 

high-throughput screening and ultra high-throughput screening. 

2 1 . The method of claim 1 , wherein at least one drug-resistant variant has at 
least one mutation resulting in a structural change relative to the target 
protein. 

1 5 22. The method of claim 1 , wherein at least one drug-resistant variant has a 
combination of two or more mutations resulting in a structural change 
relative to the target protein. 

23. The method of claim 1 , wherein at least one drug-resistant variant has at 
least one mutation that results in a change of stability relative to the 

20 target protein. 

24. The method of claim 1, wherein at least one drug-resistant variant has at 
least one global suppressor mutation that reverses die destabilizing 
effects of another antibiotic or inhibitor-resistant mutation. 

25. The method of claim 1 , wherein the target protein is a bacterial protein, 
25 viral protein, microbial protein, fungal protein, insect protein, cellular 

protein contributing to the development of cancer or cellular protein 
involved in inflammation or autoimmune disease. 

26. The method of claim 1 , wherein the target protein is from Haemophilus 
influenzae, Staphylococcus aureus y Streptococcus pneumoniae, 

30 Enterococcus faecalis, Escherichia coli. Salmonella, Pseudomonas 

aeruginosa, Bacillus anthracis, Yersiniia pestis, Francisella tularensis, 
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Brucella, Coxiella burnetii, staphylococci, Clostridium botulinum, 
Bordetella pertussis, Neisseria meningitidis Z2491 , Streptococcus 
mutan, Klebsiella pneumoniae, Chlamydia pneumoniae, Chlamydia 
trachomatis, Heliobacter pylori, Mycoplasma pneumoniae, Legionella 
5 pneumophila, Legionella micdadei y Treponema pallidum, Toxoplasma 

gondii, Borrelia burgdorferi, Campylobacter jejuni, Listeria 
monocytogenes, Neisseria gonnorhea, Neisseria meningitidis, Rickettsia 
prowazekii, Mycoplasma genitalium, Mycobacterium tuberculosis, 
Streptococcus pyogenes, Vibrio cholerae, Trypanosoma cruzi, 

10 Cryptosporidium parvum, human immunodeficiency virus (HIV), 

cytomegalovirus (CMV), hepatitis C virus (HCV), herpes simplex virus 
(HSV), bovine diarrhea virus, human rhinovirus (HRV), hepatitis B virus 
(HBV), influenza virus, variola virus (smallpox), viral encephalitides, 
arena viridae virus, bunyaviridae virus, filoviridae virus, flaviviridae 

1 5 virus, and filamentous fungi (Fusarium, Myro tectum, Cephalosporium, 

Trichoderma, Verticimonosporium, Stachybotrys species). 

27. The method of claim 1 , wherein the target protein is an enzyme, a 

transcription factor, a DNA polymerase, an RNA polymerase, a reverse 
transcriptase, a protease, a ribosomal protein or a DNA binding protein. 

20 28. The method of claim 1 , wherein the target protein is a Topo IV B subunit 
(ParE), a gyrase B subunit, a ^-lactamase, a penicillinase, a 
cephalosporinase, a choramphenicol acetyltransferase, an aminoglycoside 
modifying enzyme, a D-ala D-ala ligase (ddl), a Peptide deformylase, an 
FabI, a GlmU, a MurA, a MurB, a MurC, a MurD, a FtsZ, a Dam 

25 methylase, an emr/mmr multidrug resistance protein, a bacterial DNA 

methylase, a squalene synthase, a human immunodeficiency virus reverse 
transcriptase, a human immunodeficiency virus protease, a human 
immunodeficiency virus integrase, a human immunodeficiency virus 
TAT/TAR protein, a human immunodeficiency virus NCp7 protein, a 

30 feline immunodeficiency virus protease, a CCR5/CXCR4 protein, a 

cytomegalovirus polymerase, an HCV NS3 helicase, an HCV NS3-4A 
serine protease, an HSV protease, an HSV TS protein, a Bovine diarrhea 
virus NS5B protein involved in RNA synthesis, an HRV 3C protease, an 
HBV polyphosphate kinase, an HMG Co-A reductase, a Chitin synthase- 

35 1 , a D-glucan synthase, an inositol phosphorylceramide synthase, an EF- 

2 protein, a neuraminidase, a topoisomerase, a thrombin, a Factor Xa, a 

cathepsin B, C, S or K, a caspase, a matrix metalloproteinase, a cyclin- 
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dependent kinase, a urokinase, an EGFR tyrosine kinase, a Janus kinase 
family (JAK), a STAT protein, a Bcl-2 protein, a receptor family tyrosine 
kinase insulin receptor kinase, a protein-tyrosine phosphatase, a Src 
family kinase, a Lck family kinase, a ZAP/SYK family kinase, a TEC 
5 family tyrosine kinase, an angiotensin-converting enzyme, a monoamine 

oxidase, a tetrahydrofolate reductase or a pS3 protein. 

29. The method of claim 1, wherein the target protein is Beta-lactamase and 
the drug is a Beta-lactam antibiotic. 

30. The method of claim 29, wherein the drug is cefotaxime. 

10 31. The method of claim 1 that further comprises identifying a second new 
drug effective against at least one second generation drug-resistant 
variant comprising: 

(a) selecting at least one second generation drug-resistant variant by 
exposing the drug-resistant variant of claim 1 to the new drug; 

IS (b) obtaining the three dimensional structure of at least one second 

generation site of interaction with the new drug; 
(c) comparing the three dimensional structures of the second 

generation variant and the variant sites of interaction with the 
drug to identify structural similarities and/or differences; and 

20 (d) designing a second new drug to interact with the second 

generation variant by using the structural similarities or 
differences. 

32. The method of claim 1 , wherein SAR or another structure/function-based 
algorithm is used to design die new drug. 

25 33. The method of claim 1 , wherein said at least one drug-resistant variant(s) 
is two to nine variants. 

34. The method of claim 1, wherein said at least one drug-resistant variants) 
is about ten or more variants. 

35. The method of claim 1 , wherein said at least one drug-resistant variant(s) 
30 is about 1 00 or more variants. 

36. The method of claim I, wherein said at least one drug-resistant variants) 
is about 1000 or more variants. 
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A method for preparing a library representing a drug resistance evolution 
profile of a target protein or target protein fragment comprising: 

(a) generating a first population of sequence or structure variants 
of said target protein by one or more rounds of a first method 
of directed evolution in the presence of said drug; 

(b) generating at least one other population of sequence or 
structure variants by one or more further rounds of said first 
method of directed evolution or by at least one other method 
of directed evolution or by obtaining a library of natural 
variant isolates; 

(c) comparing sequences or structures or fragments of said 
variants in said first population to the sequences or structures 
or fragments of said variants in said at least one other 
population to identify variants having unique sequences or 
structures; and 

(d) preparing a library comprising said unique sequence variants 
that represents a drug resistance evolution profile of said 
target protein or target protein fragment. 

A library composed of drug resistant variants obtained by the method of 
claim 37, in which the library is profiled by primary amino acid 
sequence. 

A library composed of drug resistant variants obtained by the method of 
- claim 37, in which the library is profiled by three-dimensional structure. 

A data structure for storing information on directed evolution of a series 
of drug resistant variants comprising: 

(a) a series of separate variant data fields generated by directed 
evolution, each variant data field comprising each x, y and z 
atomic coordinate of each drug resistant variant; 

(b) a target protein data field comprising each x, y and z atomic 
coordinate of the target protein; 



66 



PCT/US02/06238 



(c) a structural difference data field comprising the differences in x, y 
and z atomic coordinates for each drug resistant variant relative to 
the x, y and z atomic coordinates of the target protein; and 

(d) a series of drug resistance data fields, each drug resistance data 
field comprising each x, y and z atomic coordinate of each drug 
that is resistant to a drug resistance variant 

A method for identifying a new drug effective against at least one drug- 
resistant variant of a target protein comprising screening die library 
produced by the method of claim 37 with at least one new drug. " 

A method for identifying a new drug effective against more than one 
drug-resistant variant of a target protein comprising screening the library 
produced by the method of claim 37 with at least one new drug. 

A new drug identified by the method of claim 40 or 4 1 . 

A method for identifying regions in a target protein that are structurally 
altered within a drug-resistant variant of the target protein, comprising 
comparing the structures of the variants of the drug resistance evolution 
profile produced by the method of claim 37. 

A library composed of drug resistant variants produced by the method of 
claim 37, wherein said library is composed of at least 10 variants. 

The library of claim 44, wherein said library is composed of at least 100 
variants. 

The library of claim 44, wherein said library is composed of at least 1000 
variants. 

A computer-assisted method for identifying a new drug effective against 
at least one drug-resistant variant of a target protein, comprising: 

(a) obtaining the three dimensional structure of at least one variant 
site of interaction with the drug by using computer processing to 
generate an x, y and z coordinate for each atom in die variant site 
of interaction; 

(b) obtaining the three dimensional structure of the target protein .site 
of interaction with the drug by using computer processing to 
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generate an x, y and z coordinate for each atom in the target 
protein site of interaction; 

(c) measuring the differences in distance between the x, y and z 
coordinates for each atom in the variant and target protein sites of 

5 interaction; 

(d) designing a new drug to interact with the site of interaction of at 
least one of the variants by using computer processing to fit a 
library of potential new drugs into the variant and target protein 
sites of interaction and determining which potential new drug fits 

10 into the variant site of interaction better than into the target 

protein site of interaction. 

49. An article of manufacture having instructions stored on it that cause a 
computing system to identify a new drug effective against at least one 
drug-resistant variant of a target protein, the instructions comprising: 
1 5 (a) instructions for comparing a series of separate variant data fields 

with a target protein data field to generate a structural difference 
data field comprising the differences in x, y and z atomic 
coordinates for each drug resistant variant relative to the x, y and 
z atomic coordinates of the target protein, wherein each variant 
20 data field within the series of separate variant data fields 

comprises each x, y and z atomic coordinate of each drug 
resistant variant, and wherein a target protein data field comprises 
each x, y and z atomic coordinate of the target protein; 
(b) instructions for generating a new drug data field from the 
25 structural difference data field and a small molecule data field, 

wherein the small molecule data field comprises x, y and z atomic 
coordinates for a series of small molecules and wherein the new 
drug data field comprises small molecules that can optimally fit 
within the site of interaction of the drug-resistant variant 

30 SO. A new drug identified using the article of manufacture of claim 49. 



68 



WO 02/068933 



PCT/US02/06238 



51 . A method for identifying a new antibiotic effective against at least one 
antibiotic-resistant of bacteria, comprising: 

(a) selecting at least one antibiotic-resistant strain of bacteria by 
exposing the bacteria to a selecting antibiotic; 
5 (b) obtaining the three dimensional structure of a target protein site of 

interaction with the antibiotic, wherein the target protein is within 
the bacteria and wherein the antibiotic has its effect through the 
target protein; 

(c) obtaining the three dimensional structure of a target protein site of 
10 interaction with the antibiotic, wherein the target protein is within 

the bacteria and wherein the antibiotic has its effect through the 
target protein; 

(d) obtaining the three dimensional structure of a variant protein site 
of interaction with the antibiotic, wherein the variant protein is 

1 5 within at least one antibiotic-resistant strain of bacteria selected in 

step (a) and wherein the variant protein is resistant to the selecting 
antibiotic; 

(e) comparing the three dimensional structures of the target and 
variant sites of interaction to identify structural similarities and 

20 differences; and 

(f) designing a new drug to interact with at least one of the variants 
by using the structural similarities or differences. 



52. A system, comprising: 
25 a computer system; 

memory accessible to the computer system; 

a first data structure stored in the memory comprising three dimensional 
structural data of a target protein; 

a second data structure stored in the memory comprising a first 
30 population of three dimensional structural data for a series of variants of the 
target protein, wherein the variants were generated by one or more rounds of 
directed evolution of the target protein in the presence of a drug; 



69 



WO 02/068933 



PCT/US02/06238 



a third data structure stored in the memory comprising a second 
population of three dimensional structural data for a series of second generation 
variants of a first variant, wherein the second generation of variants is generated 
by one or more rounds of directed evolution of the first variant in the presence of 
5 a drug; 

a fourth data structure comprising three dimensional structural data for a 
population of potential new drugs comprising antibiotics, enzyme inhibitors, 
protein inhibitors, antibiotic resistance inhibitors, herbicide resistance inhibitors, 
insect repellants, insecticide resistance inhibitors, viral drug resistance inhibitors, 
10 chemotherapeutic resistance inhibitors, modulators, antagonists, agonists, 

effectors, ligands, antibodies, antibody fragments, peptides or small molecules; 

a first software component operable on the computer system to compare 
the first data structure with die second or third data structure and to identify three 
dimensional structural variants that represent a drug resistance evolution profile 
15 of said target protein; 

a second software component operable on the computer system to 
compare the fourth data structure with the drug resistance evolution profile of 
said target protein and thereby to design a new drug effective against at least one 
drug resistance variant 

20 
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SEQUENCE LISTING 

<110> The Scripps Research Institute 
5 Stevens, Raymond C. 

Orencia, Maria Celilia 

<120> Small molecule design and use of crystallographic structures for 
development of new antibiotics 

10 

<130> 1361.006WO1 

<150> US 60/272,248 
<151> 2001-02-28 

15 

<160> 4 

<170> FastSEQ for Windows Version 4.0 

20<210> 1 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

25<220> 

<223> A primer 

<400> 1 

atgagtattc aacatatgtt cctggtcgcc 

30 

<210> 2 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

35 

<220> 

<223> A primer 



<400> 2 

40ttaccaatgc ttggatccaa tcagtgaggc 
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<210> 3 
<211> 861 
<212> DNA 

<213> Klebsiella pneumoniae 

5 

<400> 3 



-v f «• j». 4» — * ^ V- /-^ 




L.y L^utv 1. 1» 


attecctttt ttacocjcatt 


ttgccttcct 


60 


gtXtcXgCCC 


aCCCayaadC 


/r t* t* a st £• A 


ahaAaaaato ctaaaaatca 




120 


eg oy ty y y t u 




yy a l> v»aat~ 


aacaQtaaoa tccttcraQaa 


ttttcgcccc 


180 


lOgaagaacgtt 


ttccaatgat 


gagcactttt 


aaagtxctgc taugcggcgc 


(jytilLLotLu 


240 


cgtgttgacg 


ccgggcaaga 


gcaactcggt 


cgccgcatac actattctca 


gaatgacttg 


300 


gttaagtact 


caccagtcac 


agaaaagcat 


cttacggatg gcatgacagt 


aagagaatta 


360 


tgcagtgctg 


ccataaccat 


gagtgataac 


actgcggcca acttacttct 


gacaacgatc 


420 


ggaggaccga 


aggagctaac 


cgcttttttg 


cacaacatgg gggatcatgt 


aactcgcctt 


480 


I5gatcgttggg 


aaccggagct 


gaatgaagcc 


ataccaaacg acgagcgtga 


caccacgacg 


540 


cctgcagcaa 


tggcaacaac 


gttgcgcaaa 


ctattaactg gcgaactact 


tactctagct 


600 


tcccggcaac 


aattaataga 


ctggatggag 


gcggataaag ttgcaggacc 


acttctgcgc 


660 


tcggcccttc 


cggctggctg 


gtttattgct 


gataaatctg gagccagtga 


gcgtgggtct 


720 


cgcggtatca 


ttgcagcact 


ggggccagat 


ggtaagccct cccgtatcgt 


agttatctac 


780 


20acgacgggga 


gtcaggcaac 


tatggatgaa 


cgaaatagac agatcgctga 


gataggtgcc 


840 


tcactgatta 


agcattggta 


a 
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<210> 4 
<211> 286 
25<212> PRT 

<213> Klebsiella pneumoniae 



<400> 4 

Met Ser lie Gin His Phe Arg Val Ala Leu He Pro Phe Phe Ala Ala 
30 1 5 10 15 

Phe Cys Leu Pro Val Phe Ala His Pro Glu Thr Leu Val Lys Val Lys 

20 25 30 

Asp Ala Glu Asp Gin Leu Gly Ala Arg Val Gly Tyr He Glu Leu Asp 
35 40 45 

35Leu Asn Ser Gly Lys He Leu Glu Ser Phe Arg Pro Glu Glu Arg Phe 
50 55 60 

Pro Met Met Ser Thr Phe Lys Val Leu Leu Cys Gly Ala Val Leu Ser 
65 70 75 80 

Arg Val Asp Ala Gly Gin Glu Gin Leu Gly Arg Arg He His Tyr Ser 
40 85 90 95 
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3 

Gin Asn Asp Leu Val Lys Tyr Ser Pro Val Thr Glu Lys His Leu Thr 

100 105 HO 

Asp Gly Met Thr Val Arg Glu Leu Cys Ser Ala Ala He Thr Met Ser 
115 120 125 

5Asp Asn Thr Ala Ala Asn Leu Leu Leu Thr Thr He Gly Gly Pro Lys 
130 135 140 

Glu Leu Thr Ala Phe Leu His Asn Met Gly Asp His Val Thr Arg Leu 
145 150 155 160 

Asp Arg Trp Glu Pro Glu Leu Asn Glu Ala He Pro Asn Asp Glu Arg 
10 165 170 175 

Asp Thr Thr Thr Pro Ala Ala Met Ala Thr Thr Leu Arg Lys Leu Leu 

180 185 190 

Thr Gly Glu Leu Leu Thr Leu Ala Ser Arg Gin Gin Leu He Asp Trp 
195 200 205 

15Met Glu Ala Asp Lys Val Ala Gly Pro Leu Leu Arg Ser Ala Leu Pro 
210 215 220 

Ala Gly Trp Phe He Ala Asp Lys Ser Gly Ala Ser Glu Arg Gly Ser 
225 230 235 240 

Arg Gly He He Ala Ala Leu Gly Pro Asp Gly Lys Pro Ser Arg He 
20 245 250 255 

Val Val He Tyr Thr Thr Gly Ser Gin Ala Thr Met Asp Glu Arg Asn 

260 265 270 

Arg Gin He Ala Glu He Gly Ala Ser Leu He Lys His Trp 
275 280 285 

25 
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